Glycosyltransferase vectors for treating cancer

ABSTRACT

This disclosure provides a system for specifically killing cancer cells which can be used in the course of human therapy. Vectors of the invention comprise an encoding sequence for a glycosyltransferase, under control of a tumor or tissue specific transcriptional control element, such as the promoter for telomerase reverse transcriptase. Exemplary glycosyltransferases are the A or B transferase enzymes, which cause the cancer cells to express ABO histo blood group allotypes against which humans have naturally occurring antibody. This provides for ongoing surveillance for newly emerging cells with a malignant phenotype.

REFERENCE TO RELATED APPLICATION

[0001] This application claims priority to U.S. patent application Ser. No. 60/253,395; filed Nov. 27, 2000, pending. The priority application is hereby incorporated herein by reference in its entirety.

TECHNICAL FIELD

[0002] This invention relates generally to the field of virology and cancer therapy. This disclosure provides vectors in which an encoding region for glycosyltransferase is linked to a genetic element that controls transcription in a tumor or tissue specific fashion.

BACKGROUND

[0003] Many forms of cancer are intractable to traditional courses of radiation or small molecule pharmaceuticals. Considerable interest has evolved in developing gene therapy vectors as chemotherapeutic agents.

[0004] A broad variety of therapeutic genes are currently under investigation in preclinical and in clinical studies (Walther et al., Mol. Biotechnol. 13:21, 1999). The candidate genes have very different origins and different mechanisms of action—which include cytokine genes, genes coding for immunostimulatory molecules/antigens, genes encoding bacterial or viral prodrug-activating enzymes (suicide genes), and tumor suppressor genes.

[0005] Some of the putative vectors are based on adenovirus. U.S. Pat. Nos. 5,631,236 and 6,096,718 (Baylor College of Medicine) cover a method of causing regression in a solid tumor, using a vector containing an HSV thymidine kinase (tk) gene, followed by administration of a prodrug such as ganciclovir. U.S. Pat. No. 6,096,718 (Baylor College of Medicine) relates to the use of a replication incompetent adenoviral vector, comprising an HSV tk gene under control of the α-lactalbumin promoter.

[0006] U.S. Pat. No. 5,801,029 and 5,846,945 (Onyx Pharmaceuticals) relate to adenovirus in which the E1a gene has been altered so as not to bind and inactivate tumor suppressor p53 or RB. This prevents the virus from inactivating tumor suppression in normal cells, which means the virus cannot replicate. However, the virus will replicate in cells that have shut off p53 or RB expression through oncogenic transformation.

[0007] U.S. Pat. No. 5,998,205 (GTI/Novartis) pertains to a tissue-specific replication-conditional adenovirus, comprising a transcriptional regulatory sequence (such as the α-fetoprotein promoter) operably linked to adenovirus early replication gene. U.S. Pat. No. 5,698,443 (Calydon) provides replication-conditional adenoviruses controlled by the PSA promoter. Alemany et al. (Cancer Gene Ther. 6:21, 1999) outline complementary adenoviral vectors for oncolysis. One vector contains cis replication elements and E1a under control of a tissue-specific promoter. The supplemental vector contains all other trans-acting adenovirus replication genes. Coinfection leads to controlled killing of hepatocarcinoma cells.

[0008] International Patent Publication WO 98/14593 (Geron) describes an adenovirus construct in which the tk gene is placed under control of the promoter for telomerase reverse transcriptase (TERT). This gene is expressed at high levels in cancer cells of any tissue type, and the vector renders cancer cell lines susceptible to toxic effects of ganciclovir. WO 00/46355 (Geron) describes an oncolytic virus having a genome in which a TERT promoter is linked to a genetic element essential for replication or assembly of the virus, wherein replication of the virus in a cancer cell leads to lysis of the cancer cell.

[0009] Koga et al. (Hu. Gene Ther. 11:1397, 2000) propose a telomerase-specific gene therapy using the hTERT gene promoter linked to the apoptosis gene Caspase-8 (FLICE). Gu et al. (Cancer Res. 60:5359, 2000) reported a binary adenoviral system that induced Bax expression via the hTERT promoter. They found that it elicited tumor-specific apoptosis in vitro and suppressed tumor growth in nude mice.

[0010] Other vectors are based on herpes family viruses, such as herpes simplex type 1 and 2. U.S. Patent 5,728,379 (Georgetown University) relates to replication competent HSV containing a transcriptional regulatory sequence operatively linked to an essential HSV gene. Exemplary is the IPC4 gene under control of the pro-opiomelanocortin promoter.

[0011] Other vectors are based on the retrovirus family. U.S. Patent 5,997,859 and EP 702084 B1 (Chiron) pertain to replication-defective recombinant retrovirus, carrying a vector construct capable of preventing, inhibiting, stabilizing or reversing infections, cancer, or autoimmune disease. The virus directs expression of an enzyme not normally expressed in the cells that converts a compound into a cytotoxic form. Exemplary is the HSV tk gene. WO 99/08692 proposes the use of reovirus in treating cancer, particularly ras-mediated neoplasms.

[0012] These proposed therapeutic agents are not currently approved for commercial use in the United States. There is a need to develop new constructs to improve efficacy and specificity of cancer treatment.

SUMMARY OF THE INVENTION

[0013] This invention provides a system for killing cancer cells in vitro or in vivo, using a polynucleotide encoding a glycosyltransferase under control of a tumor specific or tissue specific transcriptional control element. The glycosyltransferase typically forms a determinant on the cell surface to which some or all humans have naturally occurring antibody. In this manner, cancer cells will be culled on an ongoing basis by antibody already present in the circulation, without the need to follow the vector with an effector agent.

[0014] One embodiment of the invention is a polynucleotide as already described. Suitable glycosyltransferase enzymes include but are not limited to histo blood group A or B transferase from any upper primate (particularly human), and α(1,3)galactosyltransferase (α1,3GT) of any mammal that forms the Galα(1,3)Gal xenoantigen.

[0015] The transcriptional control element can be a tissue specific promoter, as exemplified below. Alternatively, the control element can be a tumor specific promoter, as exemplified below. Of particular interest is the promoter for telomerase reverse transcriptase (SEQ. ID NO:1). The polynucleotide can take the form of a viral vector (for example, adenovirus, herpes virus, or retrovirus), naked DNA, or a lipid composition (for example, a neutral or anionic lipid envelope, or a cationic liposome or micelle) that has a DNA or RNA component.

[0016] Polynucleotides of the invention can be used to prepare a medicament for human treatment, especially for conditions associated with hyperproliferation, such as cancer and other neoplasias.

[0017] Another embodiment of the invention is a polypeptide with glycosyltransferase activity, which comprises a consensus of mammalian α1,3GT sequences, or a humanized α1,3GT sequence, or catalytic subfragment thereof.

[0018] Also provided is a method of killing a cancer cell, comprising combining the cancer cell with a polynucleotide as already described. The invention includes a system for testing and manufacturing the glycosyltransferase vectors of this invention. The invention can be used for treating cancer in a subject by administering to the subject a polynucleotide as already described.

[0019] Other embodiments of the invention will be apparent from the description that follows.

DRAWINGS

[0020]FIG. 1 is a map of adenovirus vector designated pGRN376, in which the promoter for telomerase reverse transcriptase (TERT) controls expression of the tk gene (Example 1).

[0021]FIG. 2 is a photographic reproduction showing the effects of replication-conditional adenovirus on normal and cancer-derived cell lines (Example 2).

[0022]FIG. 3 is a sequence listing comparing the human blood group A and B transferase amino acid sequences with α(1,3) galactosyltransferase (α1,3GT) of other species. A consensus version and a humanized version of α1,3GT are shown as SEQ. ID NOs:12 & 13. (−) represents a sequence gap; (. ) indicates a residue identical with the aligned marmoset α1,3GT sequence (Example 3). Other sequences shown in this figure are listed in Table 2.

[0023]FIG. 4 is a sequence listing comparing the marmoset α1,3GT encoding sequence with the human α1,3GT pseudogene. The humanized α1,3GT encoding sequence is shown as SEQ. ID NO:16 (Example 3). The sequences shown in this figure are listed in Table 2.

DETAILED DESCRIPTION

[0024] A long-sought objective in cancer treatment is to design a therapeutic agent that effectively kills cancer cells wherever they are located, while sparing other cells in the vicinity that do not bear the malignant phenotype.

[0025] The invention described in this disclosure solves the problem by providing a therapeutic vector that encodes an enzyme that forms a target molecule on the cell surface that can be targeted by antibody in situ. Particularly effective are so-called natural antibodies that recognize features of foreign complex carbohydrates. A number of naturally occurring anti-carbohydrate antibodies are present in the circulation of humans without deliberate immunization. It is thought that these antibodies arise from cross-reacting mucins and other carbohydrate-bearing substances that people are routinely exposed to through their diet.

[0026] In one aspect of this invention, the carbohydrate targets are produced in greater abundance on tumor cells, because expression of the enzyme that makes the target is controlled by a transcriptional control element that is tumor or tissue specific. Tumor-specific targeting relies on control elements taken from genes expressed predominantly in cells that undergo repeated proliferation, or that are relatively undifferentiated. Such vectors are effective for treating a wide variety of tumor types at the primary site or elsewhere. Tissue-specific targeting relies on control elements taken from genes expressed in particular tissue types. Such vectors are especially useful for treating metastases, or tumors in which the tissue-specific element is relatively more abundant.

[0027] Treatment is effected by administering the vector systemically or locally so that it can migrate to and transfect the tumor cells causing the disease. The vector then causes expression of the new carbohydrate structure at the cell surface. This becomes a target for antibody in the circulation (or other components of the immune system, such as cytotoxic T cells, ADCC cells, or T helper/inducer cells) —which in turn leads to a number of possible effects—complement-mediated lysis, opsonization, cytotoxic killing, cytokine and interferon secretion, and inflammatory response.

[0028] This system is believed to offer two advantages over previous approaches to gene therapy for cancer.

[0029] The first advantage is that it can provide ongoing surveillance against the emergence of new malignancies. This is available when using a tumor-specific expression vector, such as the TERT promoter described below, and when the vector is capable of replication or remains expressible by the cell. In cancer cells, the vector will cause expression of the target carbohydrate, causing them to be recognized and eliminated by antibody. In cells that are not actively malignant, the vector will remain quiescent—until such time as the cell reverts to the cancer phenotype—whereupon the target carbohydrate will be expressed de novo, and the cell becomes eliminated in its turn. Since naturally occurring antibody is persistently available, there is no need to readminister an effector drug to eradicate any newly activated cancer cells.

[0030] The second advantage is that glycosyltransferases potentially provide a second level of specificity for malignant cells. In using tumor-specific promoters to drive gene expression, there is at least a theoretical concern that the vector may also have an effect on non-cancerous cells that up-regulate the promoter transiently as part of the normal replicative process of the cell. For example, TERT is expressed transiently by some actively growing stem cells, lymphocytes, and germinal tissue.

[0031] The potential second layer of specificity provided by glycosyltransferase is related to the density of carbohydrate determinants on the surface of certain types of progenitor cells. Immune lysis of cells through glycolipid antigen depends primarily on IgG antibody. The IgG molecule must span two antigenic determinants with its two combining sites in order to activate complement—binding to only one determinant (termed monogamous bivalency) is insufficient. This means there is a minimum density of determinants that must be present in order for the antibody to activate complement.

[0032] Fetal red cells bear a low density of ABO blood group determinants, attributable to paucity of branches in the oligosaccharide. This means that ABO blood group IgG antibodies can only bind monogamously (Romans et al., J. Immunol. 124:2807, 1980). If other fetal and embryonic cells express the branching enzyme in the same limited fashion, then they may also be less susceptible to complement lysis mediated by antibodies directed against any part of the same complex carbohydrate.

[0033] This theoretical rationale is provided to enhance the reader's appreciation of the invention. Those skilled in the art will appreciate that there are other advantages in the invention beyond those indicated above. This explanation is not meant to limit the claimed invention in any way.

[0034] Further explanation of the making and use of the vector constructs of the invention is provided in the sections that follow.

[0035] Definitions

[0036] The term “polynucleotide” refers to a polymeric form of nucleotides of any length. Included are genes and gene fragments, mRNA, tRNA, rRNA, ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA and RNA, nucleic acid probes, and primers. As used in this disclosure, the term polynucleotides refer interchangeably to double- and single-stranded molecules. Unless otherwise specified or required, any embodiment of the invention that is a polynucleotide encompasses both a double-stranded form, and each of the two complementary single-stranded forms known or predicted to make up the double-stranded form.

[0037] A cell is said to be “genetically altered”, “transfected”, or “genetically transformed” when a polynucleotide has been transferred into the cell by any suitable means of artificial manipulation, or where the cell is a progeny of the originally altered cell that has inherited the polynucleotide. The polynucleotide will often comprise a transcribable sequence encoding a protein of interest, which enables the cell to express the protein at an elevated level. The genetic alteration is said to be “inheritable” if progeny of the altered cell have the same alteration.

[0038] A “control element” or “control sequence” is a nucleotide sequence that contributes to the functional regulation of a polynucleotide, such as replication, duplication, transcription, splicing, translation, or degradation of the polynucleotide. Transcriptional control elements include promoters, enhancers, and repressors.

[0039] Particular gene sequences referred to as promoters, like the “TERT promoter”, or the “PSA promoter”, are polynucleotide sequences derived from the gene referred to that promote transcription of an operatively linked gene expression product. It is recognized that various portions of the upstream and intron untranslated gene sequence may in some instances contribute to promoter activity, and that all or any subset of these portions may be present in the genetically engineered construct referred to. The promoter may be based on the gene sequence of any species having the gene, unless explicitly restricted, and may incorporate any additions, substitutions or deletions desirable, as long as the ability to promote transcription in the target tissue. Genetic constructs designed for treatment of humans may comprise a segment that at least 90% identical to a promoter sequence of a human gene. A particular sequence can be tested for activity and specificity, for example, by operatively linking to a reporter gene (Example 1).

[0040] Genetic elements are said to be “operatively linked” if they are in a structural relationship permitting them to operate in a manner according to their expected function. For instance, if a promoter helps initiate transcription of the coding sequence, the coding sequence can be referred to as operatively linked to (or under control of) the promoter. There may be intervening sequence between the promoter and coding region so long as this functional relationship is maintained.

[0041] In the context of encoding sequences, promoters, and other gene elements, the term “heterologous” indicates that the element is derived from a genotypically distinct entity from that of the rest of the entity to which it is being compared. For example, a promoter or gene introduced by genetic engineering techniques into a context; in which it does not occur in nature is said to be a heterologous polynucleotide. An “endogenous” genetic element is an element that is in the same place in the chromosome where it occurs in nature, although other gene elements may be artificially introduced into a neighboring position.

[0042] The terms “polypeptide”, “peptide” and “protein” are used interchangeably to refer to polymers of amino acids of any length. The polymer may comprise modified amino acids, it may be linear or branched, and it may be interrupted by non-amino acids.

[0043] The term “antibody” as used in this disclosure refers to both polyclonal and monoclonal antibody. The ambit of the term deliberately encompasses not only intact immunoglobulin molecules, but also such fragments and genetically engineered derivatives of immunoglobulin molecules, T cell receptors, and their equivalents as may be prepared by techniques known in the art, and which retain binding specificity of the antigen combining site.

[0044] General Techniques

[0045] Methods in molecular genetics and genetic engineering are described generally in the current editions of Molecular Cloning: A Laboratory Manual, (Sambrook et al.); Oligonucleotide Synthesis (M. J. Gait, ed.,); Animal Cell Culture (R. I. Freshney, ed.); Gene Transfer Vectors for Mammalian Cells (Miller & Calos, eds.); Current Protocols in Molecular Biology and Short Protocols in Molecular Biology, 3rd Edition (F. M. Ausubel et al., eds.); and Recombinant DNA Methodology (R. Wu ed., Academic Press). Reagents, cloning vectors, and kits for genetic manipulation referred to in this disclosure are available from commercial vendors such as BioRad, Stratagene, Invitrogen, and ClonTech.

[0046] For a description of the molecular biology of cancer, the reader is referred to Principles of Molecular Oncology (M. H. Bronchud et al. eds., Humana Press, 2000); The Biological Basis of Cancer (R. G. McKinnel et al. eds., Cambridge University Press, 1998); and Molecular Genetics of Cancer (J. K. Cowell ed., Bios Scientific Publishers, 1999).

[0047] General techniques for the development, testing, and administration of biomolecular chemotherapeutics are provided in Gene Therapy of Cancer, Adv. Exp. Med. Biol. vol. 451 (P. Walden ed., Plenum Publishing Corp., 1998); Cancer Gene Therapy, Adv. Exp. Med. Biol. vol. 465(N. A. Habib ed., Kluwer Academic Pub, 2000); and Gene Therapy of Cancer: Methods and Protocols, Meth. Mol. Med. vol. 35 (W. Walther & U. Stein eds., Humana Press, 2000).

[0048] Effector Genes for Tumor Cell Depletion

[0049] The vectors of this invention comprise an encoding region that forms a carbohydrate determinant on the cell surface as a target for cancer cell lysis.

[0050] Exemplary are glycosyltransferases that synthesize an alloantigen or xenoantigen widely expressed on different tissue types.

[0051] In humans, an α(1,2)fucosyltransferase uses N-acetyl lactosamine acceptor groups on cell surface glycoproteins and glycolipids to form Fucα(1,2)Galβ(1,4)GlocNAc, which is blood group H substance. This in turn serves as an acceptor for the ABO histo blood group transferases, which form terminal allodeterminants on the complex carbohydrate. Blood group A transferase adds GaINAc to form GalNAcα(1,3)Gal (A substance). Blood group B transferase adds Gal instead to form Galα(1,3)Gal (B substance).

[0052] According to the blood group of an individual, one or both of these transferases are expressed in essentially all nucleated cells, resulting in expression of A and B substance on the cell surface. Red cells also abundantly present A and B substance, by virtue of synthesis before enucleation, and subsequent adsorption of glycolipids from plasma. Naturally occurring antibodies circulate in the blood that react against the ABO determinants that are not self-antigens. One advantage of using an ABO transferase as the effector sequence is that the H precursor substance will be available on the surface membrane of virtually any tumor.

[0053] The nucleotide and protein sequence of A transferase and B transferase are provided below. See also U.S. Pat. Nos. 5,068,191 and 5,326,857. The two enzymes are close homologs of each other, differing by only a few amino acids. Another advantage of using an ABO transferase as the effector sequence is that the expressed protein is of human origin, and unlikely to be immunogenic by virtue of its similarity to another gene product expressed as a self antigen in the patient being treated.

[0054] Mammals other than humans, apes and Old World monkeys do not form H precursor substance, but instead convert the N-acetyl lactosamine acceptor into the Galα(1,3)Gal determinant. Galα(1,3)Gal epitope is expressed prominently on the surface of nucleated cells, including hepatic cells, renal cells, and vascular endothelium—and is the main target for the natural antibodies mediating xenograft rejection (reviewed by Joziasse et al., Biochim. Biophys. Acta 1455:403, 1999; Sandrin et al., Frontiers Biosci. 2:31,1997).

[0055] The Galα(1,3)Gal epitope is made by a specific enzyme, α(1,3)galactosyltransferase (α1,3GT). In humans and other primates that don't express the Galα(1,3)Gal product, the α1,3GT locus is inactivated (Gailili et al., Proc. Natl. Acad. Sci. USA 15:7401, 1991). There are frameshift and nonsense mutations within the locus, turning it into a non-functional, processed pseudogene (Laarsen et al., J. Biol. Chem. 265:7055,1990; Joziasse et al., J. Biol. Chem. 266:6991,1991).

[0056] For use in this invention, α1,3GT of any species can be used. A number of α1,3GT sequences are provided below. For use in human therapy, it may be beneficial to use an α1,3GT that differs as little as possible from the human pseudogene sequence, while retaining the same specificity. The complete marmoset α1,3GT sequence is provided below, and can be humanized by substituting residues from the human pseudogene that do not alter the binding or catalytic site. If desired, glycosyltransferases can also be truncated down to the minimal size of the catalytically active enzyme (Henion et al., Glycobiology 4:193, 1994).

[0057] Other glycosyltransferases can also be identified for use in this invention. Candidates include transferases responsible for other carbohydrate blood group alloantigens (for example, Lewis, P, li blood groups). Candidates also include non-mammalian glycosyltransferases, and transferases responsible for making determinants present on embryonic cells of humans and other species that are not found on most adult cells.

[0058] The choice of a particular transferase may involve a number of considerations and routine empirical testing. One consideration is the density of determinants formed on transfected cells. As discussed earlier, certain glycosyltransferases may synthesize a lower density of determinants on stem cells by virtue of the relative paucity of branched precursor substances on those cells. By judicious selection of the transferase, it may be possible to titrated the density of determinants formed. For example, A- and B-transferases will have exclusive access to H substance if transfected into type O cells, or will compete 1:1 with each other as counterparts. α1,3GT is expected to produce less determinant, because it must compete in humans with the α(1,2)fucosyltransferase that forms H substance. It has been found that α1,3GT fairs less well in this competition because of its position in the Golgi, which in turn is a function of the N-terminal membrane-anchoring domain. It is possible to switch the α(1,2)fucosyltransferase cytoplasmic domain onto α1,3GT in order to increase the density of Galα(1,3)Gal epitopes produced (Osman et al., J. Biol. Chem. 271:33105,1996).

[0059] Transcriptional Control Elements for Tumor Targeting

[0060] The control element is selected with a view to the protein expression patterns in cancer cells compared with non-malignant cells that will also be exposed to the vector.

[0061] Many tumor-specific transcriptional control elements can be used in this invention. These control elements cause elevated transcription of the encoding sequence they are linked to in tumor cells of a variety of different types. Examples are promoters that control telomerase reverse transcriptase (TERT), carcinoembryonic antigen (CEA), hypoxia-responsive element (HRE), autocrine motility factor receptor (Grp78), L-plastin, and hexokinase II.

[0062] The promoter for TERT is exemplary. Sequence of the human TERT gene (including upstream promoter sequence) is provided below. The reader is also referred to U.K. Patent GB 2321642 B (Cech et al., Geron Corporation and U. Colorado), International Patent Publications WO 00/46355 (Morin et al., Geron Corporation), WO 99/33998 (Hagen et al., Bayer Aktiengesellschaft), and Horikawa et al. (Cancer Res., 59:826, 1999). Other TERT sequences can also be used; the mouse sequence is provided in WO 99/27113 (Morin et al., Geron Corporation). A lambda phage clone designated λGΦ5, containing ˜13,500 bases upstream from the hTERT encoding sequence, is available from the ATCC under Accession No. 98505. Example 1 illustrates the testing and use of TERT promoter sequences in vector expression systems. Those skilled in the art will appreciate that promoter sequences not contained in λGΦ5 but homologous and capable of promoting preferential expression in cancer cells can be used with similar effect. For example, a TERT promoter can comprise a sequence of 25, 50, 100, or 200 consecutive nucleotides that is 80%, 90%, or 100% identical (or can hybridize under stringent conditions) to a sequence contained in SEQ. ID NO:1.

[0063] As an alternative, a transcriptional control element can be used that is tissue-specific. Constructs of this kind will cause preferential expression of the glycosyltransferase, if the level of expression of the endogenous gene is higher in tumor cells than in non-malignant tissue of the same type. They are also useful to treat tumors that have metastasized away from the primary site. Examples are promoters that control transcription of albumin (liver-specific), a-fetoprotein (AFP, liver-specific), prostate-specific antigen (PSA, prostate-specific), mitochondrial creatine kinase (MCK, muscle-specific), myelin basic protein (MBP, oligodendrocyte-specific), glial fibrillary acidic protein (GFAP, glial cell specific), and neuron-specific enolase (NSE, neuron-specific). See U.S. Pat. No. 5,871,726 (Calydon), WO 98/39466 (Calydon), U.S. Pat. No. 5998205 (Genetic Therapy Inc.).

[0064] Additional promoters suitable for use in this invention can be taken from other genes that are preferentially expressed in tumor cells. Such genes can be identified, for example, by differential display and comparative genomic hybridization: see U.S. Pat. Nos. 5,759,776 and 5,776,683. Alternatively, microarray analysis can be performed cDNA fragments of candidate genes in a 96 or 384 well format, and then spotted directly onto glass slides. To compare mRNA preparations from cancer cells and a matched non-malignant control, one preparation is converted into Cy3-labeled cDNA, while the other is converted into Cy5-labeled cDNA. The two cDNA preparations are hybridized simultaneously to the microarray slide, and then washed to eliminate non-specific binding. Any given spot on the array will bind each of the cDNA products in proportion to abundance of the transcript in the two original mRNA preparations. The slide is then scanned at wavelengths appropriate for each of the labels, and the relative abundance of mRNA is determined. Preferably, the level of expression of the effector gene will be at least 5-fold or even 25-fold higher in the undifferentiated cells relative to the differentiated cells. Having identified transcriptional control elements of interest, specificity can be tested in a reporter construct where the control element is used to control transcription of a reporter gene, such as green fluorescence protein, secreted alkaline phosphatase, or β-galactosidase.

[0065] Formulation and Administration of Cancer Therapeutics

[0066] A number of viral vectors are suitable for cancer gene therapy according to the invention. For general principles in vector construction, the reader is referred to Viral Vectors for Gene Therapy (B. J. Carter, Biotechnology 1999, XVIII, 562 p. 393,1999).

[0067] Adenovirus vectors provide transient gene expression, and can be constructed to be replication competent or replication incompetent. For general principles in adenovirus construction, see Danthinne et al., Gene Ther. 7:1707, 2000, Bilbao et al., Adv. Exp. med. Biol. 451:365, 1998, and U.S. Pat. No. 5,631,236 (Baylor College of Medicine), U.S. Pat. No. 5,670,488 (Genzyme), U.S. Pat. No. 5,698,443 (Calydon), U.S. Pat. No. 5,712,136 (GenVec), U.S. Pat. No. 5,880,102 (Duke University), U.S. Pat. No. 5,994,128 (IntroGene), U.S. Pat. No. 6,040,174 (Transgene), U.S. Pat. No. 6,096,718 (Gene Targeting Corp).

[0068] Retrovirus vectors can be constructed to provide gene expression that is inheritable by progeny of the cell it infects. U.S. Pat. Nos. 5,698,446 and 6,133,029 (Chiron). Vectors can also be based on viruses of the herpes family. U.S. Pat. No. 5,728,379 (Georgetown University). Adeno-associated virus, reovirus, and a number of other viruses are also suitable.

[0069] As an alternative, the vectors of this invention can be constructed on a technology which is not virus based. Suitable are nucleic acid-lipid complexes of various kinds, where the lipid protects the nucleic acid en route to the tumor, and facilitates entry into the cell. One form is cationic liposomes or micelles. Li et al. (Gene Ther. 5:930,1998) generally describe cationic lipid—promoter—DNA complexes for intravenous gene delivery. Another form is neutral or anionic liposomes, where the DNA is encapsulated in a lipid envelope that may express other components to inhibit non-specific uptake. U.S. Pat. No. 5,981,501 (Inex) and U.S. Pat. No. 6,043,094 (Sequus/Alza). The composition may resemble an artificial viral envelope. U.S. Pat. No. 5,766,625 (U. Florida) and WO 97/04748 (Advanced Therapies).

[0070] Also part of the invention are viral constructs in which gene expression is cell-specific, and the virus itself is replication conditional. See generally Todo et al., Cancer Gene Ther. 7:939, 2000; and WO 00/46355 (Geron). In this embodiment, the glycosyltransferase encoding region is under control of a tissue or tumor specific control element—and a gene essential for replication or packaging of the virus is also under control of a tissue or tumor-specific control element. Genes required for replication of adenovirus include E1a, E1b, E2, and E4. Genes required for replication of HSV include ICP6 and ICP4. Glycosyltransferase expression and viral replication can be controlled by the same promoter—or they can be controlled by different promoters, providing a further level of specificity for cancer cells.

[0071] Constructs comprising different glycosyltransferase encoding regions and different regulatory control elements can be tested and compared in several different assay systems. Suitable cells for these assays include human tumor cells expressing the gene from which the regulatory control element of the virus is taken (e.g., hTERT), matched with cell lines from a similar non-malignant tissue, or a tissue expressing about the same density of acceptor substrate for the glycosyltransferase. The cells can be transduced with the test vector, with a vector not comprising the glycosyltransferase sequence (negative control), and with a vector in which the glycosyltransferase is under control of a constitutive promoter (such as CMV or PGK).

[0072] Expression of the glycosyltransferase can be detected at the RNA level by RT-PCR, and at the protein level by immunocytochemistry, according to standard techniques. Expression of the cell-surface determinant synthesized by the glycosyltransferase can be detected using epitope-specific antibody or lectin, for example, by FACS. Human type B serum contains antibodies to A substance and to the Galα(1,3)Gal xenoantigen. The “IB4” lectin from Bandeiraea (Griffonia) simplicifolia (Sigma Cat. L 3019) is specific for α-D-galactosyl residues and binds both the Galα(1,3)Gal epitope, and B blood group substance. Antigen density can be compared for vectors with different promoters and effectors in quantitative assays using labeled monovalent antibody. Monogamous bivalency (the ability or inability of specific IgG to bind by more than one combining site) can be measured in suspended cells using the antiglobulin test (Romans et al., J. Immunol. 124:2807, 1980).

[0073] Ultimately, efficacy of the constructs of this invention can be assessed by their ability to trigger complement-mediated tumor cell lysis. A panel of tumor and non-tumor lines in culture is transfected with the vector, and then exposed to a source of epitope-specific antibody plus complement. For typical vectors encoding α1,3GT, fresh human serum will contain sufficient antibody and complement to cause specific lysis. For typical vectors encoding an A or B transferase, fresh serum of O blood type should cause lysis. If fresh serum is not available for the product of a particular glycosyltransferase, lysis can be measured using specific antibody and guinea pig complement. Rather than measuring lysis, the cells can be treated for a brief interval and then injected into a suitable mouse model, to determine if the treatment inhibits tumor growth.

[0074] General validation of the approach and titration of virus can be confirmed using a α1,3GT vector in α1,3GT knockout mice. U.S. Pat. No. 5,849,991 (Bresatch) reports mice that are homozygous for inactivated α1,3GT, resulting in lack of expression of Galα(1,3)Gal epitope, as determined by specific antibody. A model is developed in which the mice are injected with a representative human cancer cell line, such as a glioma. After solid tumors have developed of a sizeable diameter, the mice are injected intravenously or intratumorally with the α1,3GT vector. A dose of 10⁵ to 10⁸ pfu is the predicted test range for HSV vectors. Once the α1,3GT is expressed, anti-Galα(1,3)Gal in the plasma of these mice should opsonize the tumor cells, slowing tumor growth, potentially causing regression and increased survival.

[0075] Treatment of human patients having a tumor depends on the nature of the vectors available and the carbohydrate determinants naturally expressed on their cells. Patients of blood type O (˜46% of the U.S. population) will have natural antibody to both A and B substance, and can be treated with a vector encoding either A or B transferase. Patients of blood type A (˜38%) or B (˜12%) will have natural antibody to the opposite determinant, and can be treated with a vector encoding the corresponding transferases. Patients of blood type AB (˜4% of the population) will not be treatable using either vector. It is possible to use a mixture of A and B transferase vectors as a universal reagent for patients of blood types A, B, and O (˜96% of the population). The lytic potential of the mixture may be somewhat reduced in blood types A and B, since the transferases will be codominantly expressed.

[0076] A universal reagent suitable for treating all ABO blood groups is a vector made using the α1,3GT transferase. Since humans don't have the anti-Galα(1,3)Gal epitope, essentially everyone should have naturally occurring antibody. α1,3GT must compete in humans for the N-acetyl lactosamine acceptor substrate with the α(1,2)fucosyltransferase that makes H substance. Since α1,3GT fairs less well in this competition because of its position in the Golgi (Osman et al., J. Biol. Chem. 271:33105, 1996), a higher density of epitope will be formed by a construct that encodes the N-terminal membrane anchoring domain of the α(1,2)fucosyltransferase fused to the extramembrane catalytic domain of α1,3GT.

[0077] Dosage and formulation of medicaments intended for human therapy are designed based on the animal model experiments. For general guidance on formulation and testing of medicament formulations for human administration, the reader is referred to Biopharmaceutical Drug Design and Development (S. Wu-Pong et al. eds, Humana Press 1999); Biopharmaceuticals: Biochemistry and Biotechnology (G. Walsh, John Wiley & Sons, 1998); and the most current edition of Remington: The Science and Practice of Pharmacy (A. Gennaro, Lippincott, Williams & Wilkins). Pharmaceutical compositions of this invention may be packaged in a container with written instructions for use of the cells in human therapy, and the treatment of cancer.

[0078] The examples that follow are provided by way of further illustration, and are not meant to limit the claimed invention.

EXAMPLES Example 1 Preparation of Vectors Controlling Transcription in Cells Expressing Telomerase Reverse Transcriptase

[0079] The lambda clone designated λGΦ5 containing the hTERT promoter is deposited with the American Type Culture Collection (ATCC), 10801 University Blvd., Manassas, Va. 20110 U.S.A., under Accession No. 98505. λGΦ5 contains a 15.3 kbp insert including approximately 13,500 bases upstream from the hTERT coding sequence.

[0080] A Not1 fragment containing the hTERT promoter sequences was subcloned into the Not1 site of pUC derived plasmid, which was designated pGRN142. A subclone (plasmid pGRN140) containing a 9 kb Ncol fragment (with hTERT gene sequence and about 4 to 5 kb of lambda vector sequence) was partially sequenced to determine the orientation of the insert. pGRN140 was digested using Sall to remove lambda vector sequences, the resulting plasmid (with removed lambda sequences) designated pGRN144. The pGRN144 insert was then sequenced.

[0081] SEQ. ID NO:1 is a listing of the sequence data obtained. Nucleotides 1-43 and 15376-15418 are plasmid sequence. Thus, the genomic insert begins at residue 44 and ends at residue 15375. The beginning of the cloned cDNA fragment corresponds to residue 13490. There are Alu sequence elements located ˜1700 base pairs upstream. The sequence of the hTERT insert of pGRN142 can now be obtained from GenBank (http://www.ncbi.nlm.nih.gov/) under Accession PGRN142.INS AF121948. Numbering of hTERT residues for plasmids in the following description begins from the translation initiation codon, according to standard practice in the field. The hTERT ATG codon (the translation initiation site) begins at residue 13545 of SEQ. ID NO:1. Thus, position −1, the first upstream residue, corresponds to nucleotide 13544 in SEQ. ID NO:1.

[0082] Expression studies were conducted with reporter constructs comprising various hTERT upstream and intron sequences. A BgIII-Eco47IIII fragment from pGRN144 (described above) was digested and cloned into the BgIII-NruI site of pSEAP2Basic (ClonTech, San Diego, Calif.) to produce plasmid designated pGRN148. A second reporter-promoter, plasmid pGRN150 was made by inserting the BgIII-Fspl fragment from pGRN144 into the BgIII-NruI sites of pSEAP2. Plasmid pGRN173 was constructed by using the EcoRV-Stul (from +445 to −2482) fragment from pGRN144. This makes a promoter reporter plasmid that contains the promoter region of hTERT from approximately 2.5 kb upstream from the start of the hTERT open reading frame to just after the first intron within the coding region, with the initiating Met codon of the hTERT open reading frame changed to Leu. Plasmid pGRN175 was made by APA1(Klenow blunt)-SRF1 digestion and religation of pGRN150 to delete most of the Genomic sequence upstream of hTERT. This makes a promoter/reporter plasmid that uses 204 nucleotides of hTERT upstream sequences (from position −36 to −117). Plasmid pGRN176 was made by PML1-SRF1 religation of pGRN150 to delete most of the hTERT upstream sequences. This makes a promoter/reporter plasmid that uses 204 nucleotides of hTERT upstream sequences (from position −36 to −239).

[0083] Levels of secreted placental alkaline phosphatase (SEAP) activity were detected using the chemiluminescent substrate CSPDTM (ClonTech). SEAP activity detected in the culture medium was found to be directly proportional to changes in intracellular concentrations of SEAP mRNA. The pGRN148 and pGRN150 plasmids (hTERT promoter-reporter) and the pSEAP2 plasmid (positive control, containing the SV40 early promoter and enhancer) were transfected into test cell lines. pGRN148 and pGRN150 constructs drove SEAP expression as efficiently as the pSEAP2 in immortal (tumor-derived) cell lines. Only the pSEAP2 control gave detectable activity in mortal cells.

[0084] The ability of the hTERT promoter to specifically drive the expression of the thymidine kinase (tk) gene in tumor cells was tested using a variety of constructs: One construct, designated pGRN266, contains an EcoRI-FseI PCR fragment with the tk gene cloned into the EcoRI-FseI sites of pGRN263. pGRN263, containing approximately 2.5 kb of hTERT promoter sequence, is similar to pGRN150, but contains a neomycin gene as selection marker. pGRN267 contains an EcoRI-FseI PCR fragment with the tk gene cloned into the EcoRI-FseI sites of pGRN264. pGRN264, containing approximately 210 bp of hTERT promoter sequence, is similar to pGRN176, but contains a neomycin gene as selection marker. pGRN268 contains an EcoRI-Xbal PCR fragment with the tk gene cloned into the EcoRI-Xbal (unmethylated) sites of pGRN265. pGRN265, containing approximately 90 bp of hTERT promoter sequence, is similar to pGRN175, but contains a neomycin gene as selection marker.

[0085] These hTERT promoter/tk constructs, pGRN266, pGRN267 and pGRN268, were re-introduced into mammalian cells and tk/+ stable clones (and/or mass populations) were selected. Ganciclovir treatment in vitro of the tk/+ cells resulted in selective destruction of all tumor lines tested, including 143B, 293, HT1080, Bxpc-3′, DAOY and NIH3T3. Ganciclovir treatment had no effect on normal BJ cells.

[0086]FIG. 1 is a map of the TPAC adenovector pGRN376. It was made by cloning the NOT1-BAMH1 fragment from pGRN267 into the NOT1-BGL2 sites of pAdBN (Quantum Biotech). The 7185 bp vector comprises the herpes simplex thymidine kinase (tk) gene under control of the medium-length hTERT promoter sequence.

Example 2 Killing Cancer Cells Using Vectors Controlled by the TERT Promoter

[0087] A replication-conditional adenovirus was constructed by placing a gene involved in viral replication under control of the hTERT promoter, which should activate transcription in telomerase-expressing cancer cells. The viral construct comprised the Inverted Terminal Repeat (ITR) from adenovirus Ad2; followed by the hTERT medium-length promoter (phTERT176) operably linked to the adenovirus E1a region; followed by the rest of the adenovirus deleted for the E3 region (ΔE3). As a positive control, a similar construct was made in which E1a was placed under control of the CMV promoter, which should activate transcription in any cell.

[0088] Reagents were obtained as follows. pBR322, restriction enzymes: NEB, Beverly, Mass. Adenovirus Type 2 (Ad2), tissue culture reagents: Gibco/BRL, Grand Island, N.Y. Profection Mammalian Transfection Systems: Promega, Madison, Wis. Tumor and Normal Cell lines: ATCC, Manassas, Va., except BJ line, which was obtained from J. Smith, U. of Texas Southwestern Medical Center.

[0089] Briefly, a pBR322-based plasmid was constructed which contains the Adenovirus Type 2 genome with deletions from 356-548nt (E1a promoter region) and 27971-30937nt (E3). A multiple cloning region was inserted at the point of deletion of the E1a promoter, and hTERT promoter (−239 to −36nt) or CMV promoter (−524 to −9nt) was subsequently cloned. Numbering of the CMV sequence is in accordance with Akrigg et al., Virus Res. 2:107, 1985. Numbering of the Ad2 sequence is in accordance with “DNA Tumor Viruses: Molecular Biology of Tumor Viruses”, J. Tooze ed., Cold Spring Harbor Laboratory, N.Y. These plasmid DNAs were digested with SnaBI to liberate ITRs, then phenol-chloroform extracted, precipitated and transfected into 293A cells for propagation of the virus. Several rounds of plaque purifications were performed using A549 cells, and a final isolate was expanded on these same cells. Viruses were titered by plaque assay on 293A cells, and tested for the presence of 5′ WT Ad sequences by PCR. DNA was isolated from viruses by HIRT extraction.

[0090]FIG. 2 shows the effect of these viruses on normal and cancer-derived cell lines. Each cell line was plated and infected at an MOI=20, ˜24h post plating. The cells were then cultured over a period of 17-48 days, and fed every fourth day. The pictures shown in the Figure were taken 7 days after infection. The top row of each section shows the results of cells that were not virally infected (negative control). The middle row shows the results of cells infected with oncolytic adenovirus, in which replication gene E1a is operably linked to the hTERT promoter. The bottom row of each section shows the results of cells infected with adenovirus in which E1a is operably linked to the CMV promoter (positive control). Results are summarized in Table 1. TABLE 1 Effect of Oncolytic Virus on Cancerous and Non-cancerous Cells Uninfected Lysis by Lysis by cell phTERT- pCMV- Cell Line Origin Culture Conditions Lysis E1ΔE3 E1ΔE3 BJ foreskin fibroblast 90% DMEM/M199 + NO NO YES 10% FBS IMR lung fibroblast 90% DMEM/M199 + NO NO YES 10% FBS WI-38 lung fibroblast 90% DMEM/M199 + NO NO YES 10% FBS + 5 μg mL gentamicin A549 lung carcinoma 90% RPMI + NO YES YES 10% FBS AsPC-1 adenocarcinoma, 90% RPMI + NO YES YES pancreas 10% FBS BxPC-3 adenocarcinoma, 90% EMEM + NO YES YES pancreas 10% FBS DAOY medulloblastoma 90% EMEM + NO YES YES 10% FBS HeLa: cervical carcinoma 90% EMEM + NO YES YES 10% FBS HT1080 fibrosarcoma 90% EMEM + NO YES YES 10% FBS

[0091] All cell lines tested were efficiently lysed by AdCMV-E1 dIE3 by day 17 post-infection. All tumor lines were lysed by AdphTERT-E1dIE3 in a similar, but slightly delayed period, while normal lines showed no signs of cytopathic effect and remained healthy out to 6 weeks post-infection.

[0092] The results demonstrate that an oncolytic virus can be constructed by placing a genetic element essential for replication of the virus under control of an hTERT promoter. Replication and lysis occurs in cancer cells, but not in differentiated non-malignant cells.

Example 3 Killing Cancer Cells Using Glycosyltransferase Vectors and Natural Antibody

[0093] Adenovirus vectors comprising encoding sequences for glycosyltransferase under control of the TERT promoter are constructed by cloning the encoding sequence behind the hTERT promoter sequence of pGRN267, as described in Example 1.

[0094] SEQ. ID NO:2 and SEQ. ID NO:4 provide the encoding sequences for the A and B transferase, respectively.

[0095]FIG. 3 is a comparison of the known mammalian α1,3GT protein sequences, the ABO transferases, and the amino acid translation of the human α1,3GT pseudogene. Based on this comparison and a comparison of the gene sequences, a humanized version of the marmoset α1,3GT protein sequence has been devised (SEQ. ID NO:13). Another α1,3GT sequence has been devised in which the marmoset prototype has been adapted with substitutions in the extracellular domain to enhance activity, based on a consensus of other mammalian α1,3GT amino acid sequences (SEQ. ID NO:12).

[0096]FIG. 4 provides a listing of a humanized α1,3GT encoding sequence, adapting the marmoset nucleic acid sequence with conservative and silent substitutions in the human pseudogene (SEQ. ID NO:16).

[0097] A model adenovirus vector is made using the sheep α1,3GT encoding sequence shown in SEQ. ID NO:17. Briefly, a EcI136II fragment from a plasmid comprising the cDNA coding sequence plus ˜70 bp of untranslated upstream sequence is cloned into the EcoRI(Klenow blunted)-FseI(Klenow blunted) sites of pGRN267 such that the sheep α1,3GT gene is in the same orientation as the hTERT promoter. Then a Not1-BamHI fragment from the plasmid containing the transcription pause region, the hTERT promoter, the sheep α1,3GT gene sequence and the SV40 polyA signal is cloned into the Not1-BgIII sites of pAdBN (Quantum), which is then made into an adenovirus vector according to the manufacturer's technology.

[0098] Ability of α1,3GT and ABO transferase vectors to promote tumor cell lysis is tested using a panel of established cell lines as in Example 2.

[0099] First, the ABO phenotype of each line is determined by incubating alternate wells with fresh human serum of the A and B blood type at 37° C. for 30-60 min, and measuring trypan blue exclusion.

[0100] Fresh cells are then transduced with the test vectors at a suitable MOI, and cultured in a serum-free medium. Vectors comprising the opposite ABO transferase or α1,3GT under control of the TERT promoter are used to treat the test well. The same transferase under control of the CMV promoter is a positive control. A promoterless vector, a vector comprising ABO matched transferase, and empty vector can all serve as negative controls.

[0101] After 2 or 7 days, the cells are washed, and overlaid with fresh ABO matched human serum. After incubation at 37° C. for 30-60 min, 0.4% trypan blue is added, and the percentage of lysed (blue staining) cells is determined.

SEQUENCE DATA

[0102] TABLE 2 Sequences listed in this Disclosure SEQ. ID NO: Designation Reference 1 Lambda clone designated λGφ5 GenBank Accession AF121948 (ATCC Accession No. 98505) International Patent Publication Contains human Telomerase Reverse WO 00/46355. Transcriptase (hTERT) genomic insert (residues 44-15375). The ATG translation initiation site begins at residue 13545. 2 Human histo blood group A transferase GenBank Accession J05175 cDNA sequence See also Accession Nos. AF134413 & AF134412; Yamamoto et al., Nature May 17 1990; 345: 229 (1990); U.S. Pat. No. 5,326,857 3 Human histo blood group A transferase (supra) amino acid sequence 4 Human histo blood group B transferase GenBank Accession AF134414 cDNA sequence Yamamoto et al., Nature May 17 1990; 345: 229 (1990); U.S. Pat. No. 5,326,857 5 Human histo blood group B transferase (supra) amino acid sequence 6 Marmoset α1,3-galactosyltransferase GenBank Accession S71333 amino acid sequence Henion et al., Glycobiology 4,193 (1994) 7 Amino acid translation of human (infra) 1,3-galactosyltransferase pseudogene 8 Sheep α1,3-galactosyltransferase Chris Denning & John Clark, Geron Biomed amino acid sequence 9 Bovine α1,3-galactosyltransferase GenBank Accession J04989 amino acid sequence Joziasse et al. “Bovine α1−>3- galactosyltransferase” J. Biol. Chem. 264, 14290 (1989) 10 Pig α1,3-galactosyltransferase GenBank Accession L36152 amino acid sequence Sus scrofa alpha-1,3-galactosyltransferase mRNA. Strahan et al. “cDNA sequence and chromosome localization of pig α1,3 galactosyltransferase” Immunogenetics 41, 101 (1995) See also GenBank Accession L36535 Sandrin et al. “Characterization of cDNA clones for porcine a(1,3)galactosyl transferase” Xenotransplantation (1994) 11 Mouse α1,3-galactosyltransferase GenBank Accession M26925 amino acid sequence Larsen et al. “Isolation of a cDNA encoding a murine UDP galactose: β-D-galactosyl-1,4- N-acetyl-D-glucosaminide alpha-1,3- galactosyltransferase” Proc. Natl. Acad. Sci. USA 86, 8227 (1989) See also GenBank Accession IM85153 Joziasse et al. “Murine alpha-1,3- galactosyltransferase: A single gene locus specifies four isoforms of the enzyme by alternative splicing” J. Biol. Chem. 267, 5534 (1992) 12 Consensus α1,3-galactosyltransferase This Invention amino acid sequence 13 Humanized α1,3-galactosyltransferase This Invention amino acid sequence 14 Marmoset α1,3-galactosyltransferase GenBank Accession S71333 cDNA sequence Henion et al., Glycobiology 4,193 (1994) 15 Human α1,3-galactosyltransferase GenBank Accession J05421 pseudogene sequence Larsen et al., J. Biol. Chem. .265: 7055, 1990 See also GenBank Accession M60263 Joziasse et al. “Characterization of an alpha-1−>3-galactosyltransferase homologue on human chromosome 12 that is organized as a processed pseudogene” J. Biol. Chem. 266, 6991 (1991) 16 Humanized α1,3-galactosyltransferase This Invention encoding sequence 17 Sheep α1,3-galactosyltransferase Chris Denning & John Clark, Geron Biomed encoding sequence

[0103]

1 17 1 15418 DNA Homo sapiens 1 gcggccgcga gctctaatac gactcactat agggcgtcga ctcgatcaat ggaagatgag 60 gcattgccga agaaaagatt aatggatttg aacacacagc aacagaaact acatgaagtg 120 aaacacagga aaaaaaagat aaagaaacga aaagaaaagg gcatcagtga gcttcagcag 180 aagttccatc ggccttacat atgtgtaagc agaggccctg taggagcaga ggcaggggga 240 aaatacttta agaaataatg tctaaaagtt tttcaaatat gaggaaaaac ataaaaccac 300 agatccaaga agctcaacaa aacaaagcac aagaaacagg aagaaattaa aagttatatc 360 acagtcaaat tgctgaaaac cagcaacaaa gagaatatct taagagtatc agaggaaaag 420 agattaatga caggccaaga aacaatgaaa acaatacaga tttcttgtag gaaacacaag 480 acaaaagaca ttttttaaaa ccaaaaggaa aaaaaatgct acattaaaat gttttttacc 540 cactgaaagt atatttcaaa acatatttta ggccaggctt ggtggctcac acctgtaatc 600 ccagcacttt gggaggccaa ggtgggtgga tcgcttaagg tcaggagttc gagaccagcc 660 tggccaatat agcgaaaccc catctgtact aaaaacacaa aaattagctg ggtgtggtga 720 cacatgcctg taatcccagg tactcaggag gctaaggcag gagaattgct tgaactggga 780 ggcagaggtg gtgagccaag attgcaccag tgcactccag ccttggtgac agagtgaaac 840 tccatctcaa aaacaaacaa acaaaataca tatacataaa tatatatgca catatatata 900 catatataaa tatatataca catatataaa tctatataca tatatacata tatacacata 960 tataaatcta tatacatata tatacatata taatatattt acatatataa atatatacat 1020 atataaatat acatatataa atacatatat aaatatacat atataaatat acatatataa 1080 atatacatat ataaatatat acatatataa atatacatat ataaatatat atacatatat 1140 aaatatataa atatacaagt atatacaaat atatacatat ataaatgtat atacgtatat 1200 acatatatat ataaatatat aaaaaaactt ttggctgggc acctttccaa atctcatggc 1260 acatataagt ctcatggtaa cctcaaataa aaaaacatat aacagataca ccaaaaataa 1320 aaaccaataa attaaatcat gccaccagaa gaaattacct tcactaaaag gaacacagga 1380 aggaaagaaa gaaggaagag aagaccatga aacaaccaga aaacaaacaa caaaacagca 1440 ggagtaattc ctgacttatc aataataatg ctgggtgtaa atggactaaa ctctccaatc 1500 aaaagacata gagtggctga atggacgaaa aaaacaagac tcaataatct gttgcctaca 1560 agaatatact tcacctataa agggacacat agactgaaaa taaaaggaag gaaaaatatt 1620 ctatgcaaat ggaaaccaaa aaaagaacag aactagctac acttatatca gacaaaatag 1680 atttcaagac aaaaagtaca aaaagagaca aagtaattat ataataataa agcaaaaaga 1740 tataacaatt gtgaatttat atgcgcccaa cactgggaca cccagatata tacagcaaat 1800 attattagaa ctaaggagag agagagatcc ccatacaata atagctggag acttcacccc 1860 gcttttagca ttggacagat catccagaca gaaaatcaac caaaaaattg gacttaatct 1920 ataatataga acaaatgtac ctaattgatg tttacaagac atttcatcca gtagttgcag 1980 aatatgcatt ttttcctcag catatggatc attctcaagg atagaccata tattaggcca 2040 cagaacaagc cattaaaaat tcaaaaaaat tgagccaggc atgatggctt atgcttgtaa 2100 ttacagcact ttggggaggg tgaggtggga ggatgtcttg agtacaggag tttgagacca 2160 gcctgggcaa aatagtgaga ccctgtctct acaaactttt ttttttaatt agccaggcat 2220 agtggtgtgt gcctgtagtc ccagctactt aggaggctga agtgggagga tcacttgagc 2280 ccaagagttc aaggctacgg tgagccatga ttgcaacacc acacaccagc cttggtgaca 2340 gaatgagacc ctgtctcaaa aaaaaaaaaa aaaattgaaa taatataaag catcttctct 2400 ggccacagtg gaacaaaacc agaaatcaac aacaagagga attttgaaaa ctatacaaac 2460 acatgaaaat taaacaatat acttctgaat aaccagtgag tcaatgaaga aattaaaaag 2520 gaaattgaaa aatttattta agcaaatgat aacggaaaca taacctctca aaacccacgg 2580 tatacagcaa aagcagtgct aagaaggaag tttatagcta taagcagcta catcaaaaaa 2640 gtagaaaagc caggcgcagt ggctcatgcc tgtaatccca gcactttggg aggccaaggc 2700 gggcagatcg cctgaggtca ggagttcgag accagcctga ccaacacaga gaaaccttgt 2760 cgctactaaa aatacaaaat tagctgggca tggtggcaca tgcctgtaat cccagctact 2820 cgggaggctg aggcaggata accgcttgaa cccaggaggt ggaggttgcg gtgagccggg 2880 attgcgccat tggactccag cctgggtaac aagagtgaaa ccctgtctca agaaaaaaaa 2940 aaaagtagaa aaacttaaaa atacaaccta atgatgcacc ttaaagaact agaaaagcaa 3000 gagcaaacta aacctaaaat tggtaaaaga aaagaaataa taaagatcag agcagaaata 3060 aatgaaactg aaagataaca atacaaaaga tcaacaaaat taaaagttgg ttttttgaaa 3120 agataaacaa aattgacaaa cctttgccca gactaagaaa aaaggaaaga agacctaaat 3180 aaataaagtc agagatgaaa aaagagacat tacaactgat accacagaaa ttcaaaggat 3240 cactagaggc tactatgagc aactgtacac taataaattg aaaaacctag aaaaaataga 3300 taaattccta gatgcataca acctaccaag attgaaccat gaagaaatcc aaagcccaaa 3360 cagaccaata acaataatgg gattaaagcc ataataaaaa gtctcctagc aaagagaagc 3420 ccaggaccca atggcttccc tgctggattt taccaatcat ttaaagaaga atgaattcca 3480 atcctactca aactattctg aaaaatagag gaaagaatac ttccaaactc attctacatg 3540 gccagtatta ccctgattcc aaaaccagac aaaaacacat caaaaacaaa caaacaaaaa 3600 aacagaaaga aagaaaacta caggccaata tccctgatga atactgatac aaaaatcctc 3660 aacaaaacac tagcaaacca aattaaacaa caccttcgaa agatcattca ttgtgatcaa 3720 gtgggattta ttccagggat ggaaggatgg ttcaacatat gcaaatcaat caatgtgata 3780 catcatccca acaaaatgaa gtacaaaaac tatatgatta tttcacttta tgcagaaaaa 3840 gcatttgata aaattctgca cccttcatga taaaaaccct caaaaaacca ggtatacaag 3900 aaacatacag gccaggcaca gtggctcaca cctgcgatcc cagcactctg ggaggccaag 3960 gtgggatgat tgcttgggcc caggagtttg agactagcct gggcaacaaa atgagacctg 4020 gtctacaaaa aactttttta aaaaattagc caggcatgat ggcatatgcc tgtagtccca 4080 gctagtctgg aggctgaggt gggagaatca cttaagccta ggaggtcgag gctgcagtga 4140 gccatgaaca tgtcactgta ctccagccta gacaacagaa caagacccca ctgaataaga 4200 agaaggagaa ggagaaggga gaaaggaggg agaagggagg aggaggagaa ggaggaggtg 4260 gaggagaagt ggaaggggaa ggggaaggga aagaggaaga agaagaaaca tatttcaaca 4320 taataaaagc cctatatgac agaccgaggt agtattatga ggaaaaactg aaagcctttc 4380 ctctaagatc tggaaaatga caagggccca ctttcaccac tgtgattcaa catagtacta 4440 gaagtcctag ctagagcaat cagataagag aaagaaataa aaggcatcca aactggaaag 4500 gaagaagtca aattatcctg tttgcagatg atatgatctt atatctggaa aagacttaag 4560 acaccactaa aaaactatta gagctgaaat ttggtacagc aggatacaaa atcaatgtac 4620 aaaaatcagt agtatttcta tattccaaca gcaaacaatc tgaaaaagaa accaaaaaag 4680 cagctacaaa taaaattaaa cagctaggaa ttaaccaaag aagtgaaaga tctctacaat 4740 gaaaactata aaatattgat aaaagaaatt gaagagggca caaaaaaaga aaagatattc 4800 catgttcata gattggaaga ataaatactg ttaaaatgtc catactaccc aaagcaattt 4860 acaaattcaa tgcaatccct attaaaatac taatgacgtt cttcacagaa atagaagaaa 4920 caattctaag atttgtacag aaccacaaaa gacccagaat agccaaagct atcctgacca 4980 aaaagaacaa aactggaagc atcacattac ctgacttcaa attatactac aaagctatag 5040 taacccaaac tacatggtac tggcataaaa acagatgaga catggaccag aggaacagaa 5100 tagagaatcc agaaacaaat ccatgcatct acagtgaact catttttgac aaaggtgcca 5160 agaacatact ttggggaaaa gataatctct tcaataaatg gtgctggagg aactggatat 5220 ccatatgcaa aataacaata ctagaactct gtctctcacc atatacaaaa gcaaatcaaa 5280 atggatgaaa ggcttaaatc taaaacctca aactttgcaa ctactaaaag aaaacaccgg 5340 agaaactctc caggacattg gagtgggcaa agacttcttg agtaattccc tgcaggcaca 5400 ggcaaccaaa gcaaaaacag acaaatggga tcatatcaag ttaaaaagct tctgcccagc 5460 aaaggaaaca atcaacaaag agaagagaca acccacagaa tgggagaata tatttgcaaa 5520 ctattcatct aacaaggaat taataaccag tatatataag gagctcaaac tactctataa 5580 gaaaaacacc taataagctg attttcaaaa ataagcaaaa gatctgggta gacatttctc 5640 aaaataagtc atacaaatgg caaacaggca tctgaaaatg tgctcaacac cactgatcat 5700 cagagaaatg caaatcaaaa ctactatgag agatcatctc accccagtta aaatggcttt 5760 tattcaaaag acaggcaata acaaatgcca gtgaggatgt ggataaaagg aaacccttgg 5820 acactgttgg tgggaatgga aattgctacc actatggaga acagtttgaa agttcctcaa 5880 aaaactaaaa ataaagctac catacagcaa tcccattgct aggtatatac tccaaaaaag 5940 ggaatcagtg tatcaacaag ctatctccac tcccacattt actgcagcac tgttcatagc 6000 agccaaggtt tggaagcaac ctcagtgtcc atcaacagac gaatggaaaa agaaaatgtg 6060 gtgcacatac acaatggagt actacgcagc cataaaaaag aatgagatcc tgtcagttgc 6120 aacagcatgg ggggcactgg tcagtatgtt aagtgaaata agccaggcac agaaagacaa 6180 acttttcatg ttctccctta cttgtgggag caaaaattaa aacaattgac atagaaatag 6240 aggagaatgg tggttctaga ggggtggggg acagggtgac tagagtcaac aataatttat 6300 tgtatgtttt aaaataacta aaagagtata attgggttgt ttgtaacaca aagaaaggat 6360 aaatgcttga aggtgacaga taccccattt accctgatgt gattattaca cattgtatgc 6420 ctgtatcaaa atatctcatg tatgctatag atataaaccc tactatatta aaaattaaaa 6480 ttttaatggc caggcacggt ggctcatgtc cataatccca gcactttggg aggccgaggc 6540 ggtggatcac ctgaggtcag gagtttgaaa ccagtctggc caccatgatg aaaccctgtc 6600 tctactaaag atacaaaaat tagccaggcg tggtggcaca tacctgtagt cccaactact 6660 caggaggctg agacaggaga attgcttgaa cctgggaggc ggaggttgca gtgagccgag 6720 atcatgccac tgcactgcag cctgggtgac agagcaagac tccatctcaa aacaaaaaca 6780 aaaaaaagaa gattaaaatt gtaattttta tgtaccgtat aaatatatac tctactatat 6840 tagaagttaa aaattaaaac aattataaaa ggtaattaac cacttaatct aaaataagaa 6900 caatgtatgt ggggtttcta gcttctgaag aagtaaaagt tatggccacg atggcagaaa 6960 tgtgaggagg gaacagtgga agttactgtt gttagacgct catactctct gtaagtgact 7020 taattttaac caaagacagg ctgggagaag ttaaagaggc attctataag ccctaaaaca 7080 actgctaata atggtgaaag gtaatctcta ttaattacca ataattacag atatctctaa 7140 aatcgagctg cagaattggc acgtctgatc acaccgtcct ctcattcacg gtgctttttt 7200 tcttgtgtgc ttggagattt tcgattgtgt gttcgtgttt ggttaaactt aatctgtatg 7260 aatcctgaaa cgaaaaatgg tggtgatttc ctccagaaga attagagtac ctggcaggaa 7320 gcaggtggct ctgtggacct gagccacttc aatcttcaag ggtctctggc caagacccag 7380 gtgcaaggca gaggcctgat gacccgagga caggaaagct cggatgggaa ggggcgatga 7440 gaagcctgcc tcgttggtga gcagcgcatg aagtgccctt atttacgctt tgcaaagatt 7500 gctctggata ccatctggaa aaggcggcca gcgggaatgc aaggagtcag aagcctcctg 7560 ctcaaaccca ggccagcagc tatggcgccc acccgggcgt gtgccagagg gagaggagtc 7620 aaggcacctc gaagtatggc ttaaatcttt ttttcacctg aagcagtgac caaggtgtat 7680 tctgagggaa gcttgagtta ggtgccttct ttaaaacaga aagtcatgga agcacccttc 7740 tcaagggaaa accagacgcc cgctctgcgg tcatttacct ctttcctctc tccctctctt 7800 gccctcgcgg tttctgatcg ggacagagtg acccccgtgg agcttctccg agcccgtgct 7860 gaggaccctc ttgcaaaggg ctccacagac ccccgccctg gagagaggag tctgagcctg 7920 gcttaataac aaactgggat gtggctgggg gcggacagcg acggcgggat tcaaagactt 7980 aattccatga gtaaattcaa cctttccaca tccgaatgga tttggatttt atcttaatat 8040 tttcttaaat ttcatcaaat aacattcagg agtgcagaaa tccaaaggcg taaaacagga 8100 actgagctat gtttgccaag gtccaaggac ttaataacca tgttcagagg gatttttcgc 8160 cctaagtact ttttattggt tttcataagg tggcttaggg tgcaagggaa agtacacgag 8220 gagaggactg ggcggcaggg ctatgagcac ggcaaggcca ccggggagag agtccccggc 8280 ctgggaggct gacagcagga ccactgaccg tcctccctgg gagctgccac attgggcaac 8340 gcgaaggcgg ccacgctgcg tgtgactcag gaccccatac cggcttcctg ggcccaccca 8400 cactaaccca ggaagtcacg gagctctgaa cccgtggaaa cgaacatgac ccttgcctgc 8460 ctgcttccct gggtgggtca agggtaatga agtggtgtgc aggaaatggc catgtaaatt 8520 acacgactct gctgatgggg accgttcctt ccatcattat tcatcttcac ccccaaggac 8580 tgaatgattc cagcaacttc ttcgggtgtg acaagccatg acaacactca gtacaaacac 8640 cactctttta ctaggcccac agagcacggc ccacacccct gatatattaa gagtccagga 8700 gagatgaggc tgctttcagc caccaggctg gggtgacaac agcggctgaa cagtctgttc 8760 ctctagacta gtagaccctg gcaggcactc ccccagattc tagggcctgg ttgctgcttc 8820 ccgagggcgc catctgccct ggagactcag cctggggtgc cacactgagg ccagccctgt 8880 ctccacaccc tccgcctcca ggcctcagct tctccagcag cttcctaaac cctgggtggg 8940 ccgtgttcca gcgctactgt ctcacctgtc ccactgtgtc ttgtctcagc gacgtagctc 9000 gcacggttcc tcctcacatg gggtgtctgt ctccttcccc aacactcaca tgcgttgaag 9060 ggaggagatt ctgcgcctcc cagactggct cctctgagcc tgaacctggc tcgtggcccc 9120 cgatgcaggt tcctggcgtc cggctgcacg ctgacctcca tttccaggcg ctccccgtct 9180 cctgtcatct gccggggcct gccggtgtgt tcttctgttt ctgtgctcct ttccacgtcc 9240 agctgcgtgt gtctctgtcc gctagggtct cggggttttt ataggcatag gacgggggcg 9300 tggtgggcca gggcgctctt gggaaatgca acatttgggt gtgaaagtag gagtgcctgt 9360 cctcacctag gtccacgggc acaggcctgg ggatggagcc cccgccaggg acccgccctt 9420 ctctgcccag cacttttctg cccccctccc tctggaacac agagtggcag tttccacaag 9480 cactaagcat cctcttccca aaagacccag cattggcacc cctggacatt tgccccacag 9540 ccctgggaat tcacgtgact acgcacatca tgtacacact cccgtccacg accgaccccc 9600 gctgttttat tttaatagct acaaagcagg gaaatccctg ctaaaatgtc ctttaacaaa 9660 ctggttaaac aaacgggtcc atccgcacgg tggacagttc ctcacagtga agaggaacat 9720 gccgtttata aagcctgcag gcatctcaag ggaattacgc tgagtcaaaa ctgccacctc 9780 catgggatac gtacgcaaca tgctcaaaaa gaaagaattt caccccatgg caggggagtg 9840 gttggggggt taaggacggt gggggcagca gctgggggct actgcacgca ccttttacta 9900 aagccagttt cctggttctg atggtattgg ctcagttatg ggagactaac cataggggag 9960 tggggatggg ggaacccgga ggctgtgcca tctttgccat gcccgagtgt cctgggcagg 10020 ataatgctct agagatgccc acgtcctgat tcccccaaac ctgtggacag aacccgcccg 10080 gccccagggc ctttgcaggt gtgatctccg tgaggaccct gaggtctggg atccttcggg 10140 actacctgca ggcccgaaaa gtaatccagg ggttctggga agaggcgggc aggagggtca 10200 gaggggggca gcctcaggac gatggaggca gtcagtctga ggctgaaaag ggagggaggg 10260 cctcgagccc aggcctgcaa gcgcctccag aagctggaaa aagcggggaa gggaccctcc 10320 acggagcctg cagcaggaag gcacggctgg cccttagccc accagggccc atcgtggacc 10380 tccggcctcc gtgccatagg agggcactcg cgctgccctt ctagcatgaa gtgtgtgggg 10440 atttgcagaa gcaacaggaa acccatgcac tgtgaatcta ggattatttc aaaacaaagg 10500 tttacagaaa catccaagga cagggctgaa gtgcctccgg gcaagggcag ggcaggcacg 10560 agtgatttta tttagctatt ttattttatt tacttacttt ctgagacaga gttatgctct 10620 tgttgcccag gctggagtgc agcggcatga tcttggctca ctgcaacctc cgtctcctgg 10680 gttcaagcaa ttctcgtgcc tcagcctccc aagtagctgg gatttcaggc gtgcaccacc 10740 acacccggct aattttgtat ttttagtaga gatgggcttt caccatgttg gtcaggctga 10800 tctcaaaatc ctgacctcag gtgatccgcc cacctcagcc tcccaaagtg ctgggattac 10860 aggcatgagc cactgcacct ggcctattta accattttaa aacttccctg ggctcaagtc 10920 acacccactg gtaaggagtt catggagttc aatttcccct ttactcagga gttaccctcc 10980 tttgatattt tctgtaattc ttcgtagact ggggatacac cgtctcttga catattcaca 11040 gtttctgtga ccacctgtta tcccatggga cccactgcag gggcagctgg gaggctgcag 11100 gcttcaggtc ccagtggggt tgccatctgc cagtagaaac ctgatgtaga atcagggcgc 11160 gagtgtggac actgtcctga atctcaatgt ctcagtgtgt gctgaaacat gtagaaatta 11220 aagtccatcc ctcctactct actgggattg agccccttcc ctatcccccc ccaggggcag 11280 aggagttcct ctcactcctg tggaggaagg aatgatactt tgttattttt cactgctggt 11340 actgaatcca ctgtttcatt tgttggtttg tttgttttgt tttgagaggc ggtttcactc 11400 ttgttgctca ggctggaggg agtgcaatgg cgcgatcttg gcttactgca gcctctgcct 11460 cccaggttca agtgattctc ctgcttccgc ctcccatttg gctgggatta caggcacccg 11520 ccaccatgcc cagctaattt tttgtatttt tagtagagac gggggtgggg gtggggttca 11580 ccatgttggc caggctggtc tcgaacttct gacctcagat gatccacctg cctctgcctc 11640 ctaaagtgct gggattacag gtgtgagcca ccatgcccag ctcagaattt actctgttta 11700 gaaacatctg ggtctgaggt aggaagctca ccccactcaa gtgttgtggt gttttaagcc 11760 aatgatagaa tttttttatt gttgttagaa cactcttgat gttttacact gtgatgacta 11820 agacatcatc agcttttcaa agacacacta actgcaccca taatactggg gtgtcttctg 11880 ggtatcagcg atcttcattg aatgccggga ggcgtttcct cgccatgcac atggtgttaa 11940 ttactccagc ataatcttct gcttccattt cttctcttcc ctcttttaaa attgtgtttt 12000 ctatgttggc ttctctgcag agaaccagtg taagctacaa cttaactttt gttggaacaa 12060 attttccaaa ccgccccttt gccctagtgg cagagacaat tcacaaacac agccctttaa 12120 aaaggcttag ggatcactaa ggggatttct agaagagcga cccgtaatcc taagtattta 12180 caagacgagg ctaacctcca gcgagcgtga cagcccaggg agggtgcgag gcctgttcaa 12240 atgctagctc cataaataaa gcaatttcct ccggcagttt ctgaaagtag gaaaggttac 12300 atttaaggtt gcgtttgtta gcatttcagt gtttgccgac ctcagctaca gcatccctgc 12360 aaggcctcgg gagacccaga agtttctcgc cccttagatc caaacttgag caacccggag 12420 tctggattcc tgggaagtcc tcagctgtcc tgcggttgtg ccggggcccc aggtctggag 12480 gggaccagtg gccgtgtggc ttctactgct gggctggaag tcgggcctcc tagctctgca 12540 gtccgaggct tggagccagg tgcctggacc ccgaggctgc cctccaccct gtgcgggcgg 12600 gatgtgacca gatgttggcc tcatctgcca gacagagtgc cggggcccag ggtcaaggcc 12660 gttgtggctg gtgtgaggcg cccggtgcgc ggccagcagg agcgcctggc tccatttccc 12720 accctttctc gacgggaccg ccccggtggg tgattaacag atttggggtg gtttgctcat 12780 ggtggggacc cctcgccgcc tgagaacctg caaagagaaa tgacgggcct gtgtcaagga 12840 gcccaagtcg cggggaagtg ttgcagggag gcactccggg aggtcccgcg tgcccgtcca 12900 gggagcaatg cgtcctcggg ttcgtcccca gccgcgtcta cgcgcctccg tcctcccctt 12960 cacgtccggc attcgtggtg cccggagccc gacgccccgc gtccggacct ggaggcagcc 13020 ctgggtctcc ggatcaggcc agcggccaaa gggtcgccgc acgcacctgt tcccagggcc 13080 tccacatcat ggcccctccc tcgggttacc ccacagccta ggccgattcg acctctctcc 13140 gctggggccc tcgctggcgt ccctgcaccc tgggagcgcg agcggcgcgc gggcggggaa 13200 gcgcggccca gacccccggg tccgcccgga gcagctgcgc tgtcggggcc aggccgggct 13260 cccagtggat tcgcgggcac agacgcccag gaccgcgctt cccacgtggc ggagggactg 13320 gggacccggg cacccgtcct gccccttcac cttccagctc cgcctcctcc gcgcggaccc 13380 cgccccgtcc cgacccctcc cgggtccccg gcccagcccc ctccgggccc tcccagcccc 13440 tccccttcct ttccgcggcc ccgccctctc ctcgcggcgc gagtttcagg cagcgctgcg 13500 tcctgctgcg cacgtgggaa gccctggccc cggccacccc cgcgatgccg cgcgctcccc 13560 gctgccgagc cgtgcgctcc ctgctgcgca gccactaccg cgaggtgctg ccgctggcca 13620 cgttcgtgcg gcgcctgggg ccccagggct ggcggctggt gcagcgcggg gacccggcgg 13680 ctttccgcgc gctggtggcc cagtgcctgg tgtgcgtgcc ctgggacgca cggccgcccc 13740 ccgccgcccc ctccttccgc caggtgggcc tccccggggt cggcgtccgg ctggggttga 13800 gggcggccgg ggggaaccag cgacatgcgg agagcagcgc aggcgactca gggcgcttcc 13860 cccgcaggtg tcctgcctga aggagctggt ggcccgagtg ctgcagaggc tgtgcgagcg 13920 cggcgcgaag aacgtgctgg ccttcggctt cgcgctgctg gacggggccc gcgggggccc 13980 ccccgaggcc ttcaccacca gcgtgcgcag ctacctgccc aacacggtga ccgacgcact 14040 gcgggggagc ggggcgtggg ggctgctgct gcgccgcgtg ggcgacgacg tgctggttca 14100 cctgctggca cgctgcgcgc tctttgtgct ggtggctccc agctgcgcct accaggtgtg 14160 cgggccgccg ctgtaccagc tcggcgctgc cactcaggcc cggcccccgc cacacgctag 14220 tggaccccga aggcgtctgg gatgcgaacg ggcctggaac catagcgtca gggaggccgg 14280 ggtccccctg ggcctgccag ccccgggtgc gaggaggcgc gggggcagtg ccagccgaag 14340 tctgccgttg cccaagaggc ccaggcgtgg cgctgcccct gagccggagc ggacgcccgt 14400 tgggcagggg tcctgggccc acccgggcag gacgcgtgga ccgagtgacc gtggtttctg 14460 tgtggtgtca cctgccagac ccgccgaaga agccacctct ttggagggtg cgctctctgg 14520 cacgcgccac tcccacccat ccgtgggccg ccagcaccac gcgggccccc catccacatc 14580 gcggccacca cgtccctggg acacgccttg tcccccggtg tacgccgaga ccaagcactt 14640 cctctactcc tcaggcgaca aggagcagct gcggccctcc ttcctactca gctctctgag 14700 gcccagcctg actggcgctc ggaggctcgt ggagaccatc tttctgggtt ccaggccctg 14760 gatgccaggg actccccgca ggttgccccg cctgccccag cgctactggc aaatgcggcc 14820 cctgtttctg gagctgcttg ggaaccacgc gcagtgcccc tacggggtgc tcctcaagac 14880 gcactgcccg ctgcgagctg cggtcacccc agcagccggt gtctgtgccc gggagaagcc 14940 ccagggctct gtggcggccc ccgaggagga ggacacagac ccccgtcgcc tggtgcagct 15000 gctccgccag cacagcagcc cctggcaggt gtacggcttc gtgcgggcct gcctgcgccg 15060 gctggtgccc ccaggcctct ggggctccag gcacaacgaa cgccgcttcc tcaggaacac 15120 caagaagttc atctccctgg ggaagcatgc caagctctcg ctgcaggagc tgacgtggaa 15180 gatgagcgtg cgggactgcg cttggctgcg caggagccca ggtgaggagg tggtggccgt 15240 cgagggccca ggccccagag ctgaatgcag taggggctca gaaaaggggg caggcagagc 15300 cctggtcctc ctgtctccat cgtcacgtgg gcacacgtgg cttttcgctc aggacgtcga 15360 gtggacacgg tgatcgagtc gactcccttt agtgagggtt aattgagctc gcggccgc 15418 2 1062 DNA Homo sapiens CDS (1)..(1062) 2 atg gcc gag gtg ttg cgg acg ctg gcc gga aaa cca aaa tgc cac gca 48 Met Ala Glu Val Leu Arg Thr Leu Ala Gly Lys Pro Lys Cys His Ala 1 5 10 15 ctt cga cct atg atc ctt ttc cta ata atg ctt gtc ttg gtc ttg ttt 96 Leu Arg Pro Met Ile Leu Phe Leu Ile Met Leu Val Leu Val Leu Phe 20 25 30 ggt tac ggg gtc cta agc ccc aga agt cta atg cca gga agc ctg gaa 144 Gly Tyr Gly Val Leu Ser Pro Arg Ser Leu Met Pro Gly Ser Leu Glu 35 40 45 cgg ggg ttc tgc atg gct gtt agg gaa cct gac cat ctg cag cgc gtc 192 Arg Gly Phe Cys Met Ala Val Arg Glu Pro Asp His Leu Gln Arg Val 50 55 60 tcg ttg cca agg atg gtc tac ccc cag cca aag gtg ctg aca ccg tgg 240 Ser Leu Pro Arg Met Val Tyr Pro Gln Pro Lys Val Leu Thr Pro Trp 65 70 75 80 aag gat gtc ctc gtg gtg acc cct tgg ctg gct ccc att gtc tgg gag 288 Lys Asp Val Leu Val Val Thr Pro Trp Leu Ala Pro Ile Val Trp Glu 85 90 95 ggc aca ttc aac atc gac atc ctc aac gag cag ttc agg ctc cag aac 336 Gly Thr Phe Asn Ile Asp Ile Leu Asn Glu Gln Phe Arg Leu Gln Asn 100 105 110 acc acc att ggg tta act gtg ttt gcc atc aag aaa tac gtg gct ttc 384 Thr Thr Ile Gly Leu Thr Val Phe Ala Ile Lys Lys Tyr Val Ala Phe 115 120 125 ctg aag ctg ttc ctg gag acg gcg gag aag cac ttc atg gtg ggc cac 432 Leu Lys Leu Phe Leu Glu Thr Ala Glu Lys His Phe Met Val Gly His 130 135 140 cgt gtc cac tac tat gtc ttc acc gac cag ctg gcc gcg gtg ccc cgc 480 Arg Val His Tyr Tyr Val Phe Thr Asp Gln Leu Ala Ala Val Pro Arg 145 150 155 160 gtg acg ctg ggg acc ggt cgg cag ctg tca gtg ctg gag gtg cgc gcc 528 Val Thr Leu Gly Thr Gly Arg Gln Leu Ser Val Leu Glu Val Arg Ala 165 170 175 tac aag cgc tgg cag gac gtg tcc atg cgc cgc atg gag atg atc agt 576 Tyr Lys Arg Trp Gln Asp Val Ser Met Arg Arg Met Glu Met Ile Ser 180 185 190 gac ttc tgc gag cgg cgc ttc ctc agc gag gtg gat tac ctg gtg tgc 624 Asp Phe Cys Glu Arg Arg Phe Leu Ser Glu Val Asp Tyr Leu Val Cys 195 200 205 gtg gac gtg gac atg gag ttc cgc gac cac gtg ggc gtg gag atc ctg 672 Val Asp Val Asp Met Glu Phe Arg Asp His Val Gly Val Glu Ile Leu 210 215 220 act ccg ctg ttc ggc acc ctg cac ccc ggc ttc tac gga agc agc cgg 720 Thr Pro Leu Phe Gly Thr Leu His Pro Gly Phe Tyr Gly Ser Ser Arg 225 230 235 240 gag gcc ttc acc tac gag cgc cgg ccc cag tcc cag gcc tac atc ccc 768 Glu Ala Phe Thr Tyr Glu Arg Arg Pro Gln Ser Gln Ala Tyr Ile Pro 245 250 255 aag gac gag ggc gat ttc tac tac ctg ggg ggg ttc ttc ggg ggg tcg 816 Lys Asp Glu Gly Asp Phe Tyr Tyr Leu Gly Gly Phe Phe Gly Gly Ser 260 265 270 gtg caa gag gtg cag cgg ctc acc agg gcc tgc cac cag gcc atg atg 864 Val Gln Glu Val Gln Arg Leu Thr Arg Ala Cys His Gln Ala Met Met 275 280 285 gtc gac cag gcc aac ggc atc gag gcc gtg tgg cac gac gag agc cac 912 Val Asp Gln Ala Asn Gly Ile Glu Ala Val Trp His Asp Glu Ser His 290 295 300 ctg aac aag tac ctg ctg cgc cac aaa ccc acc aag gtg ctc tcc ccc 960 Leu Asn Lys Tyr Leu Leu Arg His Lys Pro Thr Lys Val Leu Ser Pro 305 310 315 320 gag tac ttg tgg gac cag cag ctg ctg ggc tgg ccc gcc gtc ctg agg 1008 Glu Tyr Leu Trp Asp Gln Gln Leu Leu Gly Trp Pro Ala Val Leu Arg 325 330 335 aag ctg agg ttc act gcg gtg ccc aag aac cac cag gcg gtc cgg aac 1056 Lys Leu Arg Phe Thr Ala Val Pro Lys Asn His Gln Ala Val Arg Asn 340 345 350 ccg tga 1062 Pro 3 353 PRT Homo sapiens 3 Met Ala Glu Val Leu Arg Thr Leu Ala Gly Lys Pro Lys Cys His Ala 1 5 10 15 Leu Arg Pro Met Ile Leu Phe Leu Ile Met Leu Val Leu Val Leu Phe 20 25 30 Gly Tyr Gly Val Leu Ser Pro Arg Ser Leu Met Pro Gly Ser Leu Glu 35 40 45 Arg Gly Phe Cys Met Ala Val Arg Glu Pro Asp His Leu Gln Arg Val 50 55 60 Ser Leu Pro Arg Met Val Tyr Pro Gln Pro Lys Val Leu Thr Pro Trp 65 70 75 80 Lys Asp Val Leu Val Val Thr Pro Trp Leu Ala Pro Ile Val Trp Glu 85 90 95 Gly Thr Phe Asn Ile Asp Ile Leu Asn Glu Gln Phe Arg Leu Gln Asn 100 105 110 Thr Thr Ile Gly Leu Thr Val Phe Ala Ile Lys Lys Tyr Val Ala Phe 115 120 125 Leu Lys Leu Phe Leu Glu Thr Ala Glu Lys His Phe Met Val Gly His 130 135 140 Arg Val His Tyr Tyr Val Phe Thr Asp Gln Leu Ala Ala Val Pro Arg 145 150 155 160 Val Thr Leu Gly Thr Gly Arg Gln Leu Ser Val Leu Glu Val Arg Ala 165 170 175 Tyr Lys Arg Trp Gln Asp Val Ser Met Arg Arg Met Glu Met Ile Ser 180 185 190 Asp Phe Cys Glu Arg Arg Phe Leu Ser Glu Val Asp Tyr Leu Val Cys 195 200 205 Val Asp Val Asp Met Glu Phe Arg Asp His Val Gly Val Glu Ile Leu 210 215 220 Thr Pro Leu Phe Gly Thr Leu His Pro Gly Phe Tyr Gly Ser Ser Arg 225 230 235 240 Glu Ala Phe Thr Tyr Glu Arg Arg Pro Gln Ser Gln Ala Tyr Ile Pro 245 250 255 Lys Asp Glu Gly Asp Phe Tyr Tyr Leu Gly Gly Phe Phe Gly Gly Ser 260 265 270 Val Gln Glu Val Gln Arg Leu Thr Arg Ala Cys His Gln Ala Met Met 275 280 285 Val Asp Gln Ala Asn Gly Ile Glu Ala Val Trp His Asp Glu Ser His 290 295 300 Leu Asn Lys Tyr Leu Leu Arg His Lys Pro Thr Lys Val Leu Ser Pro 305 310 315 320 Glu Tyr Leu Trp Asp Gln Gln Leu Leu Gly Trp Pro Ala Val Leu Arg 325 330 335 Lys Leu Arg Phe Thr Ala Val Pro Lys Asn His Gln Ala Val Arg Asn 340 345 350 Pro 4 1065 DNA Homo sapiens CDS (1)..(1065) 4 atg gcc gag gtg ttg cgg acg ctg gcc gga aaa cca aaa tgc cac gca 48 Met Ala Glu Val Leu Arg Thr Leu Ala Gly Lys Pro Lys Cys His Ala 1 5 10 15 ctt cga cct atg atc ctt ttc cta ata atg ctt gtc ttg gtc ttg ttt 96 Leu Arg Pro Met Ile Leu Phe Leu Ile Met Leu Val Leu Val Leu Phe 20 25 30 ggt tac ggg gtc cta agc ccc aga agt cta atg cca gga agc ctg gaa 144 Gly Tyr Gly Val Leu Ser Pro Arg Ser Leu Met Pro Gly Ser Leu Glu 35 40 45 cgg ggg ttc tgc atg gct gtt agg gaa cct gac cat ctg cag cgc gtc 192 Arg Gly Phe Cys Met Ala Val Arg Glu Pro Asp His Leu Gln Arg Val 50 55 60 tcg ttg cca agg atg gtc tac ccc cag cca aag gtg ctg aca ccg tgt 240 Ser Leu Pro Arg Met Val Tyr Pro Gln Pro Lys Val Leu Thr Pro Cys 65 70 75 80 agg aag gat gtc ctc gtg gtg acc cct tgg ctg gct ccc att gtc tgg 288 Arg Lys Asp Val Leu Val Val Thr Pro Trp Leu Ala Pro Ile Val Trp 85 90 95 gag ggc acg ttc aac atc gac atc ctc aac gag cag ttc agg ctc cag 336 Glu Gly Thr Phe Asn Ile Asp Ile Leu Asn Glu Gln Phe Arg Leu Gln 100 105 110 aac acc acc att ggg tta act gtg ttt gcc atc aag aaa tac gtg gct 384 Asn Thr Thr Ile Gly Leu Thr Val Phe Ala Ile Lys Lys Tyr Val Ala 115 120 125 ttc ctg aag ctg ttc ctg gag acg gcg gag aag cac ttc atg gtg ggc 432 Phe Leu Lys Leu Phe Leu Glu Thr Ala Glu Lys His Phe Met Val Gly 130 135 140 cac cgt gtc cac tac tat gtc ttc acc gac cag ccg gcc gcg gtg ccc 480 His Arg Val His Tyr Tyr Val Phe Thr Asp Gln Pro Ala Ala Val Pro 145 150 155 160 cgc gtg acg ctg ggg acc ggt cgg cag ctg tca gtg ctg gag gtg ggc 528 Arg Val Thr Leu Gly Thr Gly Arg Gln Leu Ser Val Leu Glu Val Gly 165 170 175 gcc tac aag cgc tgg cag gac gtg tcc atg cgc cgc atg gag atg atc 576 Ala Tyr Lys Arg Trp Gln Asp Val Ser Met Arg Arg Met Glu Met Ile 180 185 190 agt gac ttc tgc gag cgg cgc ttc ctc agc gag gtg gat tac ctg gtg 624 Ser Asp Phe Cys Glu Arg Arg Phe Leu Ser Glu Val Asp Tyr Leu Val 195 200 205 tgc gtg gac gtg gac atg gag ttc cgc gac cat gtg ggc gtg gag atc 672 Cys Val Asp Val Asp Met Glu Phe Arg Asp His Val Gly Val Glu Ile 210 215 220 ctg act ccg ctg ttc ggc acc ctg cac ccc agc ttc tac gga agc agc 720 Leu Thr Pro Leu Phe Gly Thr Leu His Pro Ser Phe Tyr Gly Ser Ser 225 230 235 240 cgg gag gcc ttc acc tac gag cgc cgg ccc cag tcc cag gcc tac atc 768 Arg Glu Ala Phe Thr Tyr Glu Arg Arg Pro Gln Ser Gln Ala Tyr Ile 245 250 255 ccc aag gac gag ggc gat ttc tac tac atg ggg gcg ttc ttc ggg ggg 816 Pro Lys Asp Glu Gly Asp Phe Tyr Tyr Met Gly Ala Phe Phe Gly Gly 260 265 270 tcg gtg caa gag gtg cag cgg ctc acc agg gcc tgc cac cag gcc atg 864 Ser Val Gln Glu Val Gln Arg Leu Thr Arg Ala Cys His Gln Ala Met 275 280 285 atg gtc gac cag gcc aac ggc atc gag gcc gtg tgg cac gac gag agc 912 Met Val Asp Gln Ala Asn Gly Ile Glu Ala Val Trp His Asp Glu Ser 290 295 300 cac ctg aac aag tac cta ctg cgc cac aaa ccc acc aag gtg ctc tcc 960 His Leu Asn Lys Tyr Leu Leu Arg His Lys Pro Thr Lys Val Leu Ser 305 310 315 320 ccc gag tac ttg tgg gac cag cag ctg ctg ggc tgg ccc gcc gtc ctg 1008 Pro Glu Tyr Leu Trp Asp Gln Gln Leu Leu Gly Trp Pro Ala Val Leu 325 330 335 agg aag ctg agg ttc act gcg gtg ccc aag aac cac cag gcg gtc cgg 1056 Arg Lys Leu Arg Phe Thr Ala Val Pro Lys Asn His Gln Ala Val Arg 340 345 350 aac ccg tga 1065 Asn Pro 5 354 PRT Homo sapiens 5 Met Ala Glu Val Leu Arg Thr Leu Ala Gly Lys Pro Lys Cys His Ala 1 5 10 15 Leu Arg Pro Met Ile Leu Phe Leu Ile Met Leu Val Leu Val Leu Phe 20 25 30 Gly Tyr Gly Val Leu Ser Pro Arg Ser Leu Met Pro Gly Ser Leu Glu 35 40 45 Arg Gly Phe Cys Met Ala Val Arg Glu Pro Asp His Leu Gln Arg Val 50 55 60 Ser Leu Pro Arg Met Val Tyr Pro Gln Pro Lys Val Leu Thr Pro Cys 65 70 75 80 Arg Lys Asp Val Leu Val Val Thr Pro Trp Leu Ala Pro Ile Val Trp 85 90 95 Glu Gly Thr Phe Asn Ile Asp Ile Leu Asn Glu Gln Phe Arg Leu Gln 100 105 110 Asn Thr Thr Ile Gly Leu Thr Val Phe Ala Ile Lys Lys Tyr Val Ala 115 120 125 Phe Leu Lys Leu Phe Leu Glu Thr Ala Glu Lys His Phe Met Val Gly 130 135 140 His Arg Val His Tyr Tyr Val Phe Thr Asp Gln Pro Ala Ala Val Pro 145 150 155 160 Arg Val Thr Leu Gly Thr Gly Arg Gln Leu Ser Val Leu Glu Val Gly 165 170 175 Ala Tyr Lys Arg Trp Gln Asp Val Ser Met Arg Arg Met Glu Met Ile 180 185 190 Ser Asp Phe Cys Glu Arg Arg Phe Leu Ser Glu Val Asp Tyr Leu Val 195 200 205 Cys Val Asp Val Asp Met Glu Phe Arg Asp His Val Gly Val Glu Ile 210 215 220 Leu Thr Pro Leu Phe Gly Thr Leu His Pro Ser Phe Tyr Gly Ser Ser 225 230 235 240 Arg Glu Ala Phe Thr Tyr Glu Arg Arg Pro Gln Ser Gln Ala Tyr Ile 245 250 255 Pro Lys Asp Glu Gly Asp Phe Tyr Tyr Met Gly Ala Phe Phe Gly Gly 260 265 270 Ser Val Gln Glu Val Gln Arg Leu Thr Arg Ala Cys His Gln Ala Met 275 280 285 Met Val Asp Gln Ala Asn Gly Ile Glu Ala Val Trp His Asp Glu Ser 290 295 300 His Leu Asn Lys Tyr Leu Leu Arg His Lys Pro Thr Lys Val Leu Ser 305 310 315 320 Pro Glu Tyr Leu Trp Asp Gln Gln Leu Leu Gly Trp Pro Ala Val Leu 325 330 335 Arg Lys Leu Arg Phe Thr Ala Val Pro Lys Asn His Gln Ala Val Arg 340 345 350 Asn Pro 6 376 PRT Platyrrhinus helleri 6 Met Asn Val Lys Gly Lys Val Ile Leu Ser Met Leu Val Val Ser Thr 1 5 10 15 Val Ile Val Val Phe Trp Glu Tyr Ile Asn Ser Pro Glu Gly Ser Phe 20 25 30 Leu Trp Ile Tyr His Ser Lys Asn Pro Glu Val Asp Asp Ser Ser Ala 35 40 45 Gln Lys Asp Trp Trp Phe Pro Gly Trp Phe Asn Asn Gly Ile His Asn 50 55 60 Tyr Gln Gln Glu Glu Glu Asp Thr Asp Lys Glu Lys Gly Arg Glu Glu 65 70 75 80 Glu Gln Lys Lys Glu Asp Asp Thr Thr Glu Leu Arg Leu Trp Asp Trp 85 90 95 Phe Asn Pro Lys Lys Arg Pro Glu Val Met Thr Val Thr Gln Trp Lys 100 105 110 Ala Pro Val Val Trp Glu Gly Thr Tyr Asn Lys Ala Ile Leu Glu Asn 115 120 125 Tyr Tyr Ala Lys Gln Lys Ile Thr Val Gly Leu Thr Val Phe Ala Ile 130 135 140 Gly Arg Tyr Ile Glu His Tyr Leu Glu Glu Phe Val Thr Ser Ala Asn 145 150 155 160 Arg Tyr Phe Met Val Gly His Lys Val Ile Phe Tyr Val Met Val Asp 165 170 175 Asp Val Ser Lys Ala Pro Phe Ile Glu Leu Gly Pro Leu Arg Ser Phe 180 185 190 Lys Val Phe Glu Val Lys Pro Glu Lys Arg Trp Gln Asp Ile Ser Met 195 200 205 Met Arg Met Lys Thr Ile Gly Glu His Ile Leu Ala His Ile Gln His 210 215 220 Glu Val Asp Phe Leu Phe Cys Met Asp Val Asp Gln Val Phe Gln Asp 225 230 235 240 His Phe Gly Val Glu Thr Leu Gly Gln Ser Val Ala Gln Leu Gln Ala 245 250 255 Trp Trp Tyr Lys Ala Asp Pro Asp Asp Phe Thr Tyr Glu Arg Arg Lys 260 265 270 Glu Ser Ala Ala Tyr Ile Pro Phe Gly Gln Gly Asp Phe Tyr Tyr His 275 280 285 Ala Ala Ile Phe Gly Gly Thr Pro Ile Gln Val Leu Asn Ile Thr Gln 290 295 300 Glu Cys Phe Lys Gly Ile Leu Leu Asp Lys Lys Asn Asp Ile Glu Ala 305 310 315 320 Glu Trp His Asp Glu Ser His Leu Asn Lys Tyr Phe Leu Leu Asn Lys 325 330 335 Pro Ser Lys Ile Leu Ser Pro Glu Tyr Cys Trp Asp Tyr His Ile Gly 340 345 350 Leu Pro Ser Asp Ile Lys Thr Val Lys Leu Ser Trp Gln Thr Lys Glu 355 360 365 Tyr Asn Leu Val Arg Lys Asn Val 370 375 7 227 PRT Homo sapiens 7 Arg Tyr Asn Asp His Tyr Leu Glu Glu Phe Ile Thr Ser Ala Asn Arg 1 5 10 15 Tyr Phe Met Val Gly His Lys Val Ile Phe Tyr Ile Met Val Asp Asp 20 25 30 Val Ser Lys Leu Pro Phe Ile Glu Leu Gly Pro Leu His Ser Phe Lys 35 40 45 Met Phe Glu Val Lys Pro Glu Lys Arg Trp Gln Asp Ile Ser Met Met 50 55 60 Arg Met Lys Ile Thr Gly Glu His Ile Leu Ala His Ile Gln His Glu 65 70 75 80 Val Asp Phe Leu Phe Cys Met Asp Val Asp Gln Val Phe Gln Asp His 85 90 95 Phe Gly Val Glu Thr Leu Gly Gln Ser Val Ala Gln Leu Gln Trp Arg 100 105 110 Tyr Lys Ala Asp Pro Tyr Asp Phe Thr Glu Arg Trp Lys Glu Ser Ala 115 120 125 Gly Tyr Ile Pro Phe Gly Gly Asp Phe Tyr Tyr His Ala Ala Ile Ser 130 135 140 Gly Gly Thr Pro Ile Gln Val Leu Asn Ile Thr Gln Glu Cys Phe Lys 145 150 155 160 Gly Ile Leu Leu Asp Lys Lys Asn Asp Ile Glu Ala Lys Trp His Asp 165 170 175 Glu Ser His Leu Asn Lys Tyr Phe Leu Leu Asn Lys Pro Ser Lys Ile 180 185 190 Leu Ser Leu Lys Tyr Cys Trp Asp Tyr His Ile Gly Leu Pro Ser Asp 195 200 205 Ile Lys Thr Val Lys Ser Trp Gln Thr Lys Glu Tyr Asn Leu Val Arg 210 215 220 Asn Asn Val 225 8 369 PRT Ovis aries 8 Met Asn Val Lys Gly Lys Val Ile Leu Ser Met Leu Val Val Ser Thr 1 5 10 15 Val Ile Val Val Phe Trp Glu Tyr Ile His Ser Pro Glu Gly Ser Leu 20 25 30 Phe Trp Ile Asn Pro Ser Arg Asn Pro Glu Val Ser Gly Gly Ser Ser 35 40 45 Ile Gln Lys Gly Trp Trp Phe Pro Arg Trp Phe Asn Asn Gly Tyr Gln 50 55 60 Glu Glu Asp Glu Asp Val Asp Glu Glu Lys Glu Gln Arg Lys Glu Asp 65 70 75 80 Lys Ser Lys Leu Lys Leu Ser Asp Trp Phe Asn Pro Phe Lys Arg Pro 85 90 95 Glu Val Val Thr Met Thr Asp Trp Lys Ala Pro Val Val Trp Glu Gly 100 105 110 Thr Tyr Asn Arg Ala Val Leu Asp Asp Tyr Tyr Ala Lys Gln Lys Ile 115 120 125 Thr Val Gly Leu Thr Val Phe Ala Val Gly Arg Tyr Ile Glu His Tyr 130 135 140 Leu Glu Glu Phe Leu Thr Ser Ala Asn Lys His Phe Met Val Gly His 145 150 155 160 Arg Val Ile Phe Tyr Val Met Val Asp Asp Val Ser Arg Met Pro Leu 165 170 175 Ile Glu Leu Gly Pro Leu Arg Ser Phe Lys Val Phe Glu Val Lys Pro 180 185 190 Glu Arg Arg Trp Gln Asp Val Ser Met Val Arg Met Lys Thr Ile Gly 195 200 205 Glu His Ile Val Ala His Ile Gln Arg Glu Val Asp Phe Leu Phe Cys 210 215 220 Met Asp Val Asp Gln Val Phe Gln Asp Glu Phe Gly Val Glu Thr Leu 225 230 235 240 Gly Glu Ser Val Ala Gln Leu Gln Ala Trp Trp Tyr Lys Ala Asp Pro 245 250 255 Asp Glu Phe Thr Tyr Glu Arg Arg Lys Glu Ser Ala Ala Tyr Ile Pro 260 265 270 Phe Gly Glu Gly Asp Phe Tyr Tyr His Ala Ala Ile Phe Gly Gly Thr 275 280 285 Pro Thr Gln Val Leu Asn Ile Thr Gln Glu Cys Phe Lys Gly Ile Leu 290 295 300 Lys Asp Lys Lys Asn Asp Ile Glu Ala Gln Trp His Asp Glu Ser His 305 310 315 320 Leu Asn Lys Tyr Phe Leu Leu Asn Lys Pro Thr Lys Ile Leu Ser Pro 325 330 335 Glu Tyr Cys Trp Asp Tyr His Ile Gly Leu Pro Ala Asp Ile Lys Leu 340 345 350 Val Lys Met Ser Trp Gln Thr Lys Glu Tyr Asn Val Val Arg Asn Asn 355 360 365 Val 9 368 PRT Bos taurus 9 Met Asn Val Lys Gly Lys Val Ile Leu Ser Met Leu Val Val Ser Thr 1 5 10 15 Val Ile Val Val Phe Trp Glu Tyr Ile His Ser Pro Glu Gly Ser Leu 20 25 30 Phe Trp Ile Asn Pro Ser Arg Asn Pro Glu Val Gly Gly Ser Ser Ile 35 40 45 Gln Lys Gly Trp Trp Leu Pro Arg Trp Phe Asn Asn Gly Tyr His Glu 50 55 60 Glu Asp Gly Asp Ile Asn Glu Glu Lys Glu Gln Arg Asn Glu Asp Glu 65 70 75 80 Ser Lys Leu Lys Leu Ser Asp Trp Phe Asn Pro Phe Lys Arg Pro Glu 85 90 95 Val Val Thr Met Thr Lys Trp Lys Ala Pro Val Val Trp Glu Gly Thr 100 105 110 Tyr Asn Arg Ala Val Leu Asp Asn Tyr Tyr Ala Lys Gln Lys Ile Thr 115 120 125 Val Gly Leu Thr Val Phe Ala Val Gly Arg Tyr Ile Glu His Tyr Leu 130 135 140 Glu Glu Phe Leu Thr Ser Ala Asn Lys His Phe Met Val Gly His Pro 145 150 155 160 Val Ile Phe Tyr Ile Met Val Asp Asp Val Ser Arg Met Pro Leu Ile 165 170 175 Glu Leu Gly Pro Leu Arg Ser Phe Lys Val Phe Lys Ile Lys Pro Glu 180 185 190 Lys Arg Trp Gln Asp Ile Ser Met Met Arg Met Lys Thr Ile Gly Glu 195 200 205 His Ile Val Ala His Ile Gln His Glu Val Asp Phe Leu Phe Cys Met 210 215 220 Asp Val Asp Gln Val Phe Gln Asp Lys Phe Gly Val Glu Thr Leu Gly 225 230 235 240 Glu Ser Val Ala Gln Leu Gln Ala Trp Trp Tyr Lys Ala Asp Pro Asn 245 250 255 Asp Phe Thr Tyr Glu Arg Arg Lys Glu Ser Ala Ala Tyr Ile Pro Phe 260 265 270 Gly Glu Gly Asp Phe Tyr Tyr His Ala Ala Ile Phe Gly Gly Thr Pro 275 280 285 Thr Gln Val Leu Asn Ile Thr Gln Glu Cys Phe Lys Gly Ile Leu Lys 290 295 300 Asp Lys Lys Asn Asp Ile Glu Ala Gln Trp His Asp Glu Ser His Leu 305 310 315 320 Asn Lys Tyr Phe Leu Leu Asn Lys Pro Thr Lys Ile Leu Ser Pro Glu 325 330 335 Tyr Cys Trp Asp Tyr His Ile Gly Leu Pro Ala Asp Ile Lys Leu Val 340 345 350 Lys Met Ser Trp Gln Thr Lys Glu Tyr Asn Val Val Arg Asn Asn Val 355 360 365 10 371 PRT Sus scrofa 10 Met Asn Val Lys Gly Arg Val Val Leu Ser Met Leu Leu Val Ser Thr 1 5 10 15 Val Met Val Val Phe Trp Glu Tyr Ile Asn Ser Pro Glu Gly Ser Leu 20 25 30 Phe Trp Ile Tyr Gln Ser Lys Asn Pro Glu Val Gly Ser Ser Ala Gln 35 40 45 Arg Gly Trp Trp Phe Pro Ser Trp Phe Asn Asn Gly Thr His Ser Tyr 50 55 60 His Glu Glu Glu Asp Ala Ile Gly Asn Glu Lys Glu Gln Arg Lys Glu 65 70 75 80 Asp Asn Arg Gly Glu Leu Pro Leu Val Asp Trp Phe Asn Pro Glu Lys 85 90 95 Arg Pro Glu Val Val Thr Ile Thr Arg Trp Lys Ala Pro Val Val Trp 100 105 110 Glu Gly Thr Tyr Asn Arg Ala Val Leu Asp Asn Tyr Tyr Ala Lys Gln 115 120 125 Lys Ile Thr Val Gly Leu Thr Val Phe Ala Val Gly Arg Tyr Ile Glu 130 135 140 His Tyr Leu Glu Glu Phe Leu Ile Ser Ala Asn Thr Tyr Phe Met Val 145 150 155 160 Gly His Lys Val Ile Phe Tyr Ile Met Val Asp Asp Ile Ser Arg Met 165 170 175 Pro Leu Ile Glu Leu Gly Pro Leu Arg Ser Phe Lys Val Phe Glu Ile 180 185 190 Lys Ser Glu Lys Arg Trp Gln Asp Ile Ser Met Met Arg Met Lys Thr 195 200 205 Ile Gly Glu His Ile Leu Ala His Ile Gln His Glu Val Asp Phe Leu 210 215 220 Phe Cys Met Asp Val Asp Gln Val Phe Gln Asn Asn Phe Gly Val Glu 225 230 235 240 Thr Leu Gly Gln Ser Val Ala Gln Leu Gln Ala Trp Trp Tyr Lys Ala 245 250 255 His Pro Asp Glu Phe Thr Tyr Glu Arg Arg Lys Glu Ser Ala Ala Tyr 260 265 270 Ile Pro Phe Gly Gln Gly Asp Phe Tyr Tyr His Ala Ala Ile Phe Gly 275 280 285 Gly Thr Pro Thr Gln Val Leu Asn Ile Thr Gln Glu Cys Phe Lys Gly 290 295 300 Ile Leu Gln Asp Lys Glu Asn Asp Ile Glu Ala Glu Trp His Asp Glu 305 310 315 320 Ser His Leu Asn Lys Tyr Phe Leu Leu Asn Lys Pro Thr Lys Ile Leu 325 330 335 Ser Pro Glu Tyr Cys Trp Asp Tyr His Ile Gly Met Ser Val Asp Ile 340 345 350 Arg Ile Val Lys Ile Ala Trp Gln Lys Lys Glu Tyr Asn Leu Val Arg 355 360 365 Asn Asn Ile 370 11 359 PRT Mus musculus 11 Met Asn Val Lys Gly Lys Val Ile Leu Leu Met Leu Ile Val Ser Thr 1 5 10 15 Val Val Val Val Phe Trp Glu Tyr Val Asn Arg Ile Pro Glu Val Gly 20 25 30 Glu Asn Arg Trp Gln Lys Asp Trp Trp Phe Pro Ser Trp Phe Lys Asn 35 40 45 Gly Thr His Ser Tyr Gln Glu Asp Asn Val Glu Gly Arg Arg Glu Lys 50 55 60 Gly Arg Asn Gly Asp Arg Ile Glu Glu Pro Gln Leu Trp Asp Trp Phe 65 70 75 80 Asn Pro Lys Asn Arg Pro Asp Val Leu Thr Val Thr Pro Trp Lys Ala 85 90 95 Pro Ile Val Trp Glu Gly Thr Tyr Asp Thr Ala Leu Leu Glu Lys Tyr 100 105 110 Tyr Ala Thr Gln Lys Leu Thr Val Gly Leu Thr Val Phe Ala Val Gly 115 120 125 Lys Tyr Ile Glu His Tyr Leu Glu Asp Phe Leu Glu Ser Ala Asp Met 130 135 140 Tyr Phe Met Val Gly His Arg Val Ile Phe Tyr Val Met Ile Asp Asp 145 150 155 160 Thr Ser Arg Met Pro Val Val His Leu Asn Pro Leu His Ser Leu Gln 165 170 175 Val Phe Glu Ile Arg Ser Glu Lys Arg Trp Gln Asp Ile Ser Met Met 180 185 190 Arg Met Lys Thr Ile Gly Glu His Ile Leu Ala His Ile Gln His Glu 195 200 205 Val Asp Phe Leu Phe Cys Met Asp Val Asp Gln Val Phe Gln Asp Asn 210 215 220 Phe Gly Val Glu Thr Leu Gly Gln Leu Val Ala Gln Leu Gln Ala Trp 225 230 235 240 Trp Tyr Lys Ala Ser Pro Glu Lys Phe Thr Tyr Glu Arg Arg Glu Leu 245 250 255 Ser Ala Ala Tyr Ile Pro Phe Gly Glu Gly Asp Phe Tyr Tyr His Ala 260 265 270 Ala Ile Phe Gly Gly Thr Pro Thr His Ile Leu Asn Leu Thr Arg Glu 275 280 285 Cys Phe Lys Gly Ile Leu Gln Asp Lys Lys His Asp Ile Glu Ala Gln 290 295 300 Trp His Asp Glu Ser His Leu Asn Lys Tyr Phe Leu Phe Asn Lys Pro 305 310 315 320 Thr Lys Ile Leu Ser Pro Glu Tyr Cys Trp Asp Tyr Gln Ile Gly Leu 325 330 335 Pro Ser Asp Ile Lys Ser Val Lys Val Ala Trp Gln Thr Lys Glu Tyr 340 345 350 Asn Leu Val Arg Asn Asn Val 355 12 376 PRT Artificial Sequence Consensus of mammalian galactosyl transferase sequences - this in vention 12 Met Asn Val Lys Gly Lys Val Ile Leu Ser Met Leu Val Val Ser Thr 1 5 10 15 Val Ile Val Val Phe Trp Glu Tyr Ile Asn Ser Pro Glu Gly Ser Phe 20 25 30 Leu Trp Ile Tyr His Ser Lys Asn Pro Glu Val Asp Asp Ser Ser Ala 35 40 45 Gln Lys Asp Trp Trp Phe Pro Gly Trp Phe Asn Asn Gly Ile His Asn 50 55 60 Tyr Gln Gln Glu Glu Glu Asp Thr Asp Lys Glu Lys Gly Arg Glu Glu 65 70 75 80 Glu Gln Lys Lys Glu Asp Asp Thr Thr Glu Leu Arg Leu Trp Asp Trp 85 90 95 Phe Asn Pro Lys Lys Arg Pro Glu Val Met Thr Val Thr Gln Trp Lys 100 105 110 Ala Pro Val Val Trp Glu Gly Thr Tyr Asn Lys Ala Ile Leu Glu Asn 115 120 125 Tyr Tyr Ala Lys Gln Lys Ile Thr Val Gly Leu Thr Val Phe Ala Ile 130 135 140 Gly Arg Tyr Ile Glu His Tyr Leu Glu Glu Phe Leu Thr Ser Ala Asn 145 150 155 160 Arg Tyr Phe Met Val Gly His Lys Val Ile Phe Tyr Val Met Val Asp 165 170 175 Asp Val Ser Lys Ala Pro Phe Ile Glu Leu Gly Pro Leu Arg Ser Phe 180 185 190 Lys Val Phe Glu Val Lys Pro Glu Lys Arg Trp Gln Asp Ile Ser Met 195 200 205 Met Arg Met Lys Thr Ile Gly Glu His Ile Leu Ala His Ile Gln His 210 215 220 Glu Val Asp Phe Leu Phe Cys Met Asp Val Asp Gln Val Phe Gln Asp 225 230 235 240 His Phe Gly Val Glu Thr Leu Gly Gln Ser Val Ala Gln Leu Gln Ala 245 250 255 Trp Trp Tyr Lys Ala Asp Pro Asp Asp Phe Thr Tyr Glu Arg Arg Lys 260 265 270 Glu Ser Ala Ala Tyr Ile Pro Phe Gly Gln Gly Asp Phe Tyr Tyr His 275 280 285 Ala Ala Ile Phe Gly Gly Thr Pro Ile Gln Val Leu Asn Ile Thr Gln 290 295 300 Glu Cys Phe Lys Gly Ile Leu Leu Asp Lys Lys Asn Asp Ile Glu Ala 305 310 315 320 Glu Trp His Asp Glu Ser His Leu Asn Lys Tyr Phe Leu Leu Asn Lys 325 330 335 Pro Ser Lys Ile Leu Ser Pro Glu Tyr Cys Trp Asp Tyr His Ile Gly 340 345 350 Leu Pro Ser Asp Ile Lys Thr Val Lys Leu Ser Trp Gln Thr Lys Glu 355 360 365 Tyr Asn Leu Val Arg Lys Asn Val 370 375 13 376 PRT Artificial Sequence Humanized galactosyl transferase sequence - This invention 13 Met Asn Val Lys Gly Lys Val Ile Leu Ser Met Leu Val Val Ser Thr 1 5 10 15 Val Ile Val Val Phe Trp Glu Tyr Ile Asn Ser Pro Glu Gly Ser Phe 20 25 30 Leu Trp Ile Tyr His Ser Lys Asn Pro Glu Val Asp Asp Ser Ser Ala 35 40 45 Gln Lys Asp Trp Trp Phe Pro Gly Trp Phe Asn Asn Gly Ile His Asn 50 55 60 Tyr Gln Gln Glu Glu Glu Asp Thr Asp Lys Glu Lys Gly Arg Glu Glu 65 70 75 80 Glu Gln Lys Lys Glu Asp Asp Thr Thr Glu Leu Arg Leu Trp Asp Trp 85 90 95 Phe Asn Pro Lys Lys Arg Pro Glu Val Met Thr Val Thr Gln Trp Lys 100 105 110 Ala Pro Val Val Trp Glu Gly Thr Tyr Asn Lys Ala Ile Leu Glu Asn 115 120 125 Tyr Tyr Ala Lys Gln Lys Ile Thr Val Gly Leu Thr Val Phe Ala Ile 130 135 140 Gly Arg Tyr Ile Asp His Tyr Leu Glu Glu Phe Leu Thr Ser Ala Asn 145 150 155 160 Arg Tyr Phe Met Val Gly His Lys Val Ile Phe Tyr Ile Met Val Asp 165 170 175 Asp Val Ser Lys Ala Pro Phe Ile Glu Leu Gly Pro Leu Arg Ser Phe 180 185 190 Lys Val Phe Glu Val Lys Pro Glu Lys Arg Trp Gln Asp Ile Ser Met 195 200 205 Met Arg Met Lys Ile Thr Gly Glu His Ile Leu Ala His Ile Gln His 210 215 220 Glu Val Asp Phe Leu Phe Cys Met Asp Val Asp Gln Val Phe Gln Asp 225 230 235 240 His Phe Gly Val Glu Thr Leu Gly Gln Ser Val Ala Gln Leu Gln Ala 245 250 255 Trp Trp Tyr Lys Ala Asp Pro Asp Asp Phe Thr Tyr Glu Arg Arg Lys 260 265 270 Glu Ser Ala Gly Tyr Ile Pro Phe Gly Gln Gly Asp Phe Tyr Tyr His 275 280 285 Ala Ala Ile Phe Gly Gly Thr Pro Ile Gln Val Leu Asn Ile Thr Gln 290 295 300 Glu Cys Phe Lys Gly Ile Leu Leu Asp Lys Lys Asn Asp Ile Glu Ala 305 310 315 320 Glu Trp His Asp Glu Ser His Leu Asn Lys Tyr Phe Leu Leu Asn Lys 325 330 335 Pro Ser Lys Ile Leu Ser Pro Glu Tyr Cys Trp Asp Tyr His Ile Gly 340 345 350 Leu Pro Ser Asp Ile Lys Thr Val Lys Leu Ser Trp Gln Thr Lys Glu 355 360 365 Tyr Asn Leu Val Arg Lys Asn Val 370 375 14 1131 DNA Platyrrhinus helleri 14 atgaatgtca aaggaaaagt aattctgtcg atgctggttg tctcaactgt gattgttgtg 60 ttttgggaat atatcaacag cccagaaggc tctttcttgt ggatatatca ctcaaagaac 120 ccagaagttg atgacagcag tgctcagaag gactggtggt ttcctggctg gtttaacaat 180 gggatccaca attatcaaca agaggaagaa gacacagaca aagaaaaagg aagagaggag 240 gaacaaaaaa aggaagatga cacaacagag cttcggctat gggactggtt taatccaaag 300 aaacgcccag aggttatgac agtgacccaa tggaaggcgc cggttgtgtg ggaaggcact 360 tacaacaaag ccatcctaga aaattattat gccaaacaga aaattaccgt ggggttgacg 420 gtttttgcta ttggaagata tattgagcat tacttggagg agttcgtaac atctgctaat 480 aggtacttca tggtcggcca caaagtcata ttttatgtca tggtggatga tgtctccaag 540 gcgccgttta tagagctggg tcctctgcgt tccttcaaag tgtttgaggt caagccagag 600 aagaggtggc aagacatcag catgatgcgt atgaagacca tcggggagca catcttggcc 660 cacatccaac acgaggttga cttcctcttc tgcatggatg tggaccaggt cttccaagac 720 cattttgggg tagagaccct gggccagtcg gtggctcagc tacaggcctg gtggtacaag 780 gcagatcctg atgactttac ctatgagagg cggaaagagt cggcagcata tattccattt 840 ggccaggggg atttttatta ccatgcagcc atttttggag gaacaccgat tcaggttctc 900 aacatcaccc aggagtgctt taagggaatc ctcctggaca agaaaaatga catagaagcc 960 gagtggcatg atgaaagcca cctaaacaag tatttccttc tcaacaaacc ctctaaaatc 1020 ttatctccag aatactgctg ggattatcat ataggcctgc cttcagatat taaaactgtc 1080 aagctatcat ggcaaacaaa agagtataat ttggttagaa agaatgtctg a 1131 15 755 DNA Homo sapiens 15 cagcttgtgg tttctttcag gaatcccaga ggataaatgt tttgcttttc ttctttgttt 60 cagatataat gatcattact tggaggagtt cataacatct gctaataggt acttcatggt 120 tggccacaaa gtcatatttt acatcatggt ggatgatgtc tccaagctgc cgtttataga 180 gctgggtcct ctgcattcct tcaaaatgtt tgaggtcaag ccagagaaga ggtggcaaga 240 catcagcatg atgcgtatga agatcactgg ggagcacatc ttggcccaca tccaacacga 300 ggtcgacttc ctcttctgca tggatgtgga ccaggtcttc caagaccatt ttggggtgga 360 gaccctaggc cagtcagtgg ctcagctaca ggctggcggt acaaggcaga tccctatgac 420 tttacctagg agaggtggaa agagtcagca ggatacattc catttggcca ggggattttt 480 attaccatgc agccatttct ggaggaacac ccattcaggt tctcaacatc acccaggagt 540 gctttaaggg aatcctcctg gacaagaaaa atgacataga agccaagtgg catgatgaaa 600 gccacctaaa caagtatttc cttctcaata aaccctctaa aatcttatcc ctaaaatact 660 gctgggatta tcatataggc ctgccttcag atattaaaac tgtcaagtga tcgtggcaga 720 caaaagagta taatttggtt agaaataatg tctga 755 16 1131 DNA Artificial Sequence Humanized galactosyl transferase sequence - This invention 16 atgaatgtca aaggaaaagt aattctgtcg atgctggttg tctcaactgt gattgttgtg 60 ttttgggaat atatcaacag cccagaaggc tctttcttgt ggatatatca ctcaaagaac 120 ccagaagttg atgacagcag tgctcagaag gactggtggt ttcctggctg gtttaacaat 180 gggatccaca attatcaaca agaggaagaa gacacagaca aagaaaaagg aagagaggag 240 gaacaaaaaa aggaagatga cacaacagag cttcggctat gggactggtt taatccaaag 300 aaacgcccag aggttatgac agtgacccaa tggaaggcgc cggttgtgtg ggaaggcact 360 tacaacaaag ccatcctaga aaattattat gccaaacaga aaattaccgt ggggttgacg 420 gtttttgcta ttggaagata tattgatcat tacttggagg agttcttaac atctgctaat 480 aggtacttca tggttggcca caaagtcata ttttacatca tggtggatga tgtctccaag 540 gcgccgttta tagagctggg tcctctgcgt tccttcaaag tgtttgaggt caagccagag 600 aagaggtggc aagacatcag catgatgcgt atgaagatca ctggggagca catcttggcc 660 cacatccaac acgaggtcga cttcctcttc tgcatggatg tggaccaggt cttccaagac 720 cattttgggg tggagaccct aggccagtca gtggctcagc tacaggcctg gtggtacaag 780 gcagatcccg atgactttac ctatgagagg cggaaagagt cagcaggata cattccattt 840 ggccaggggg atttttatta ccatgcagcc atttttggag gaacacccat tcaggttctc 900 aacatcaccc aggagtgctt taagggaatc ctcctggaca agaaaaatga catagaagcc 960 gagtggcatg atgaaagcca cctaaacaag tatttccttc tcaataaacc ctctaaaatc 1020 ttatccccag aatactgctg ggattatcat ataggcctgc cttcagatat taaaactgtc 1080 aagctatcgt ggcagacaaa agagtataat ttggttagaa ataatgtctg a 1131 17 1303 DNA Ovis aries 17 agccgaggac gccgccgggg agccgaggct ccggccagcc cccagcgcgc ccagcttctg 60 cagatcagga gtcagaacgc tgcaccttcg cttcctccca gccctgcctc cttctgcaaa 120 acggagctca atagaacttg gtacttttgc cttttactct gggaggagag aagcagacga 180 tgaggagaaa ataatgaatg tcaaaggaaa agtgattctg tcaatgctgg ttgtctcaac 240 tgtcattgtt gtgttttggg aatatatcca cagcccagaa ggctctttgt tctggataaa 300 cccatcaaga aacccagaag tcagtggcgg cagcagcatt cagaagggct ggtggtttcc 360 gagatggttt aacaatggtt accaagaaga agatgaagac gtagacgaag aaaaggaaca 420 aagaaaggaa gacaaaagca agcttaagct atcggactgg ttcaacccat ttaaacgccc 480 tgaggttgtg actatgacag attggaaggc acccgtggtg tgggaaggca cttacaacag 540 agccgtctta gacgattact acgccaagca gaaaattacc gtcggcctga cggttttcgc 600 cgtcggaaga tacattgagc attacttgga ggagttctta acgtctgcta ataagcactt 660 catggttggc caccgagtca tcttttacgt catggtggac gacgtctcca ggatgccttt 720 gatagagctg ggccctctgc gctccttcaa agtgtttgag gtcaagcctg agaggaggtg 780 gcaggacgtc agcatggtgc gcatgaagac catcggggag cacatcgtgg cccacatcca 840 gcgtgaggtt gacttcctct tctgcatgga cgtggaccag gtcttccaag acgagttcgg 900 ggtggagacc ctgggtgagt cggtggccca gctacaggcc tggtggtaca aggcagatcc 960 cgatgagttt acctacgaga ggcgcaagga gtctgcagca tacattccct tcggcgaagg 1020 ggatttttat taccacgcag ccatttttgg gggaacaccc actcaggtcc ttaacatcac 1080 ccaggaatgc ttcaaaggaa tcctcaagga caagaaaaat gacatagaag cccaatggca 1140 tgatgagagc catctaaaca agtatttcct tctcaacaaa cccactaaaa tcttatcccc 1200 ggaatactgc tgggattatc atataggcct acctgcggat attaagcttg tcaagatgtc 1260 ttggcagaca aaagagtata atgtggttag aaataacgtc tga 1303 

What is claimed as the invention is:
 1. A polynucleotide comprising an encoding sequence for a glycosyltransferase under control of a heterologous tumor specific or tissue specific transcriptional control element, wherein expression of the polynucleotide in a human cell causes the cell to express a cell-surface carbohydrate determinant to which some or all humans have naturally occurring antibody.
 2. The polynucleotide of claim 1, wherein the glycosyltransferase is a blood group A transferase.
 3. The polynucleotide of claim 1, wherein the glycosyltransferase is a blood group B transferase.
 4. The polynucleotide of claim 1, wherein the glycosyltransferase is an α(1,3)galactosyltransferase.
 5. The polynucleotide of claim 1, wherein the transcriptional control element is a tissue specific promoter, which is a promoter for albumin, α-fetoprotein, prostate-specific antigen (PSA), mitochondrial creatine kinase (MCK), myelin basic protein (MB), glial fibrillary acidic protein (GFAP), or neuron-specific enolase (NSE).
 6. The polynucleotide of claim 1, wherein the transcriptional control element is a tumor specific promoter, which is a promoter for telomerase reverse transcriptase (TERT), carcinoembryonic antigen (CEA), hypoxia-responsive element (HRE), Grp78, L-plastin, or hexokinase II.
 7. The polynucleotide of claim 6, wherein the promoter comprises at least 25 consecutive nucleotides in SEQ. ID NO:1.
 8. The polynucleotide of claim 1, wherein the encoding sequence encodes either: a polypeptide comprising an amino acid sequence selected from the group consisting of SEQ. ID NOs:3, 5, 6, 12, and 13; or a fragment thereof having galactosyltransferase activity.
 9. A humanized or consensus α(1,3)galactosyltransferase, comprising the amino acid sequence shown in SEQ. ID NOs:12 or 13, or a fragment thereof having galactosyltransferase activity.
 10. A polynucleotide comprising an encoding sequence for a human ABO histo blood group transferase under control of a tissue or tumor specific transcriptional control element.
 11. The polynucleotide of claim 1, which is a viral vector.
 12. The polynucleotide of claim 11, which is an adenovirus vector.
 13. A human cell containing the polynucleotide of claim
 1. 14. A method of killing a cancer cell, comprising combining the cancer cell with the polynucleotide of claim
 1. 